scieee Science in your language
[en] (orig)
Geometric Complexity Theory
and Orbit Closures of Homogeneous Forms
vorgelegt von
Diplom-Mathematiker
Jesko Hüttenhain
aus Siegen
Von der Fakultät II - Mathematik und Naturwissenschaften
der Technischen Universität Berlin
zur Erlangung des akademischen Grades
Doktor der Naturwissenschaften
Dr. rer. nat.
genehmigte Dissertation
Promotionsausschuss
Vorsitzender: Prof. Dr. Jörg Liesen
Gutachter : Prof. Dr. Peter Bürgisser
Gutachter : Prof. Dr. Giorgio Ottaviani
Tag der wissenschaftlichen Aussprache: 3. Juli 2017
Berlin 2017
Niemals aufgeben, niemals kapitulieren.
Peter Quincy Taggart
Deutsche Einleitung
Das P-NP-Problem gehört zu den fundamentalsten und faszinierensten Problemen der
heutigen Mathematik. Es hat umfangreiche Bedeutung für praktische Anwendungen
und ist gleichermaßen eine grundsätzliche Frage über die Natur der Mathematik an
sich. Wäre unerwartet P=NP, so könnte etwa ein Computer effizient bestimmen,
ob eine mathematische Aussage wahr oder falsch ist. Seit die Frage 1971 von Cook
[Coo71] gestellt wurde, scheinen wir einer Antwort jedoch nicht nennenswert näher
gekommen zu sein. Der größte Fortschritt ist das ernüchternde Resultat von Razborov
und Rudich [RR97], dass es keine „natürlichen“ Beweise für P=NP geben kann, siehe
deren Arbeit für eine Definition und Details.
Peter Bürgisser hat gezeigt [Bü00], dass unter der verallgemeinerten Riemann-
Hypothese die nicht-uniforme Version von P=NP eine Vermutung von Valiant im-
pliziert, welche weithin als ein algebraisches Analogon betrachtet wird. Diese Vermu-
tung ist ein ebenso offenes Problem wie das ursprüngliche, doch es gibt die Hoffnung,
dass die zusätzliche algebraische Struktur mehr Ansatzpunkte liefert. Wir geben einen
kurzen Überblick über die zugrundeliegende Theorie in Kapitel 1: Die besagte Ver-
mutung von Valiant (Vermutung 1.4.5) ist die zentrale Motivation für die hier vorge-
stellten Forschungsergebnisse.
In Kapitel 2verschärfen wir ein Resultat von Valiant indem wir zeigen, dass sich
jedes ganzzahlige Polynom stets als Determinante einer Matrix schreiben lässt, de-
ren Einträge nur Variablen, Nullen und Einsen sind. The Größe der kleinsten solchen
Matrix ist ein sinnvolles Komplexitätsmaß, welches sich rein kombinatorisch untersu-
chen lässt. Als Anwendung beweisen wir untere Schranken in kleinen Fällen durch
Computerberechnung.
Von hier an widmen wir uns einem Ansatz zum Beweis von Valiant’s Vermutung,
welcher 2001 von Mulmuley und Sohoni vorgestellt wurde und den Titel „Geometric
Complexity Theory“ trägt, oder auch kurz GCT. Valiants Ergebnisse werden hier in
Aussagen der Algebraischen Geometrie und Darstellungstheorie übersetzt, um da-
durch die Probleme vorheriger Ansätze zu vermeiden [For09]. Eine Zusammenfas-
sung dieses interessanten Übersetzungsprozesses ist in Kapitel 3zu finden.
iii
Um Komplexitätsklassen voneinander zu trennen, muss bewiesen werden, dass
kein effizienter Algorithmus für ein mutmaßlich schweres Problem existiert. GCT lie-
fert ein Kriterium, wonach die Existenz gewisser ganzzahliger Vektoren schon diese
Trennung impliziert. Solche Vektoren nennen wir auch Obstruktionen. Die Hoffnung
von Mulmuley und Sohoni war, dass man sich für den Beweis von Valiants Vermu-
tung auf eine bestimmte Art von Obstruktionen beschränken kann, doch wir konn-
ten bereits 2015 bemerken, dass es starke Anzeichen dagegen gibt. Wir stellen diese
Ergebnisse in Kapitel 4vor. Erst kürzlich haben Bürgisser, Ikenmeyer und Panova
schlussendlich gezeigt, dass diese sogenannten occurrence obstructions nicht ausrei-
chen, um Valiants Vermutung zu beweisen.
Im zweiten Teil der Arbeit beschäftigen wir uns näher und in größerer Allgemein-
heit mit einem der zentralen Konzepte von GCT: Wir betrachten die Wirkung der GLn
auf dem Raum der homogenen, n-variaten Polynome vom Grad ddurch Verkettung
von rechts und beobachten, dass alle Elemente einer Bahn im Wesentlichen die gleiche
Berechnungskomplexität haben. Man ordnet einem Polynom Pdas kleinste dzu, so
dass Pim Bahnabschluss des d×d Determinantenpolynoms liegt. Diese Kennzahl
ist dann äquivalent zu Valiants ursprünglichem Komplexitätsmaß für Polynome, so-
fern auch Approximationen zugelassen werden. Die Geometrie des Bahnabschlusses
der Determinante ist wenig verstanden, sogar für kleine Werte von d. Wir können al-
lerdings die im Rand auftretenden Komponenten für d=3 vollständig klassifizieren.
Die benötigten algebraischen und geometrischen Werkzeuge werden im einfüh-
renden Kapitel 5vorgestellt. Bahnabschlüsse von homogenen Formen sind stets al-
gebraische Varietäten mit der oben genannten GLn-Wirkung und werden klassisch
in der geometrischen Invariantentheorie studiert. Neben der Determinante beschäfti-
gen wir uns in den darauffolgenden Kapiteln auch mit den Bahnabschlüssen anderer
homogener Formen.
In Kapitel 6behandeln wir das allgemeine Monom x1···xd, dessen Relevanz für
GCT daher stammt, dass es auch als Einschränkung des Determinantenpolynoms auf
Diagonalmatrizen verstanden werden kann. Das Studium seines Bahnabschlusses ist
beispielsweise das zentrale Hilfsmittel in Kapitel 4. Wir bemerken in diesem Kapitel,
dass das Monom die seltene Eigenschaft hat, dass jedes Polynom in seinem Bahnab-
schluss lediglich das Ergebnis einer Variablensubstitution ist. Eine Klassifikation aller
Polynome mit dieser Eigenschaft bleibt offen, obwohl wir einige Fragen im Hinblick
darauf beantworten können.
Wir stellen für die beiden abschließenden Kapitel noch weitere Techniken in Kapi-
tel 7vor. Maßgeblich ist eine obere Schranke für die Anzahl irreduzibler Komponen-
ten des Randes eines Bahnabschlusses. Die erste Anwendung dieser Aussage liefert
eine Beschreibung des Randes für det3in Kapitel 8. Wir bestimmen hier auch den
Stabilisator der Determinante einer allgemeinen, spurlosen Matrix und können so
iv
schlussfolgern, dass der Bahnabschluss dieses Polynoms stets eine Komponente im
Rand des Bahnabschlusses der Determinante ist.
Das letzte Kapitel enthält bisher unveröffentlichte Ergebnisse über das allgemeine
Binom x1···xd+y1···yd. Wie auch das Monom ist das Binom im Bahnabschluss
von detdenthalten. Ein solides Verständnis dieser Familie homogener Formen sollte
daher Voraussetzung dafür sein, den Bahnabschluss des Determinantenpolynoms im
Allgemeinen zu studieren. Die hier aufkommenden geometrischen Fragen sind bereits
deutlich komplexer als im Fall des Monoms: Wir können zwei Komponenten des
Randes im Detail beschreiben, müssen jedoch eine subtile Frage offen lassen. Sofern
diese sich positiv beantworten lässt, erhalten wir jedoch so bereits eine vollständige
Beschreibung des Randes.
v
Contents
Acknowledgements 1
Introduction 3
I Geometric Complexity Theory 7
1 Algebraic Complexity Theory 9
1.1 Arithmetic Circuits ............................... 9
1.2 The Classes VP and P ............................. 12
1.3 Reduction and Completeness ......................... 13
1.4 The Classes VNP and NP ........................... 14
1.5 Determinant Versus Permanent ........................ 16
2 Binary Determinantal Complexity 21
2.1 The Cost of Computing Integers ....................... 22
2.2 Lower Bounds .................................. 26
2.3 Uniqueness of Grenet’s construction in the 7 ×7 case ........... 29
2.4 Algebraic Complexity Classes ......................... 30
2.5 Graph Constructions for Polynomials .................... 31
3 Geometric Complexity Theory 35
3.1 Orbit and Orbit Closure as Complexity Measure .............. 35
3.2 Border Complexity ............................... 37
3.3 The Flip via Obstructions ........................... 39
3.4 Orbit and Orbit Closure ............................ 41
4 Occurrence Obstructions 45
4.1 Weight Semigroups ............................... 45
4.2 Saturations of Weight Semigroups of Varieties ............... 47
4.3 Proof of Main Results ............................. 49
vii
II Orbit Closures of Homogeneous Forms 55
5 Preliminaries 57
5.1 Conciseness ................................... 58
5.2 Grading of Coordinate Rings and Projectivization ............. 60
5.3 Rational Orbit Map ............................... 61
6 Closed Forms 65
6.1 A Sufficient Criterion .............................. 65
6.2 Normalizations of Orbit Closures ....................... 67
6.3 Proof of Main Theorem ............................ 71
7 Techniques for Boundary Classification 77
7.1 Approximating Degenerations ........................ 77
7.2 The Lie Algebra Action ............................. 80
7.3 Resolving the Rational Orbit Map ...................... 82
8 The 3 by 3 Determinant Polynomial 89
8.1 Construction of Two Components of the Boundary ............ 90
8.2 There Are Only Two Components ...................... 91
8.3 The Traceless Determinant ........................... 96
8.4 The Boundary of the 4 ×4 Determinant ................... 102
9 The Binomial 107
9.1 Stabilizer and Maximal Linear Subspaces .................. 108
9.2 The First Boundary Component ....................... 110
9.3 The Second Boundary Component ...................... 116
9.4 The Indeterminacy Locus ........................... 118
III Appendix 123
A Algebraic Groups and Representation Theory 125
A.1 Algebraic Semigroups and Groups ...................... 125
A.2 Representation Theory of Reductive Groups ................ 130
A.3 Polynomial Representations .......................... 139
Bibliography 141
List of Symbols 149
Index 153
viii
Acknowledgements
I had the immense privilege to work in the most comfortable scientific environment
imaginable, having both the freedom to pursue my interests and the support of my
advisor, Peter Bürgisser. I want to thank him heartily for providing this and also
for the many things that I learned from him. Researching under these great con-
ditions was also made possible by the grants BU 1371/2-2 and BU 1371/3-2 of the
Deutsche Forschungsgemeinschaft. Their support is highly appreciated. In late 2014 I
participated in the program on Algorithms and Complexity in Algebraic Geometry at the
Simons Institute with great benefit. I am very grateful for this opportunity and extend
my thanks to the organizers Peter Bürgisser, Joseph Landsberg, Ketan Mulmuley and
Bernd Sturmfels.
I also had the pleasure to enjoy the company of extraordinary colleagues, two of
which are also my coauthors. I thank Christian Ikenmeyer and Pierre Lairez for the
confusion we shared, the time we spent pondering and the insights we could claim
together.
My parents have given me more than words could ever say, and my gratitude for
all their love and support is eternal.
Ever since a memorable chess game in late 2004, I am part of a friendship that
particularly sustained me throughout the years. I want to thank Nikolai Nowaczyk
for sharing it with me across time, space, and the fall of empires.
1
Introduction
The Pvs. NP problem is among the deepest and most intriguing questions of math-
ematics today. While having a multitude of implications for practical applications, it
also carries fundamental questions in its wake, about the nature of mathematics itself.
If for example P=NP would hold unexpectedly, then we could program a machine
to efficiently determine the truth of mathematical statements. Introduced in 1971 by
Cook [Coo71], the problem is already 46 years old and we seem as far from the so-
lution as ever. The most notable progress so far is the sobering result by Razborov
and Rudich [RR97] that no “natural” proof for P=NP exists, see their work for a
definition and details.
Peter Bürgisser has shown [Bü00] that the nonuniform version of P=NP im-
plies, under the generalized Riemann hypothesis, a conjecture by Valiant which is
widely considered an algebraic analogue. It remains as unresolved as the former,
even though the additional algebraic structure involved is believed to provide better
points of vantage. We give a brief review of the underlying theory in Chapter 1 It
is the said conjecture by Valiant (Conjecture 1.4.5) that serves as motivation for the
research presented herein.
In Chapter 2, we slightly strengthen a result by Valiant: The core observation is
that integer polynomials can always be written as the determinant of a matrix whose
entries are variables, zeros and ones. The size of the smallest such matrix then gives a
reasonable complexity measure which at the same time is accessible to combinatorics.
As an application, we can provide lower bounds in small cases by computational
methods.
From there on, we are concerned with a recent approach to Valiant’s Conjecture
known as Geometric Complexity Theory, or GCT for short. Introduced in 2001 by
Mulmuley and Sohoni, it avoids the difficulties of many previous attempts [For09] by
translating Valiant’s results to statements in algebraic geometry and representation
theory. An outline of this enticing transition is given in Chapter 3.
Where complexity theory is clasically concerned with proving the nonexistence of
good algorithms for supposedly hard problems, GCT provides a criterion whereby the
3
existence of certain integer vectors implies a separation of complexity classes. We refer
to the vectors in question simply as obstructions. It was the hope of Mulmuley and
Sohoni that the search could be further restricted to a particular kind of obstructions,
but we already observed in 2015 that this appears unlikely we present these results
in Chapter 4. Quite recently in 2016 it was shown by Bürgisser, Ikenmeyer, and
Panova that indeed these so-called occurrence obstructions do not suffice to prove
Valiant’s Conjecture, representing a bitter setback for the programme.
For the second part of the thesis, we treat a central topic of GCT in more detail and
generality: One can measure the approximate complexity of polynomials by studying
the closure of their orbit under the action of a general linear group by precomposi-
tion. The smallest dfor which a polynomial appears in the orbit closure of the d×d
determinant polynomial is equivalent to Valiant’s original measure for its complexity,
if approximations are permitted. The geometry of the determinant orbit closure is
little understood even for small values of d, but we can give a classification of the
components that appear in the boundary for d=3.
The required toolbox of geometric and algebraic techniques is introduced in the
preliminary Chapter 5. Quite generally, the orbit closure of a homogeneous form is an
algebraic variety with a GLn-action which is both intuitive and yet incredibly intrigu-
ing, and is an object of study to the beautiful fields of classical geometric invariant
theory and birational geometry. In the subsequent chapters we study this problem for
other homogeneous forms than the determinant.
Chapter 6deals with a polynomial that has appeared in the context of GCT before,
namely the universal monomial x1···xd. Its orbit closure is contained in the orbit
closure of detdand was instrumental in proving the results of Chapter 4, for example.
It also has the remarkable and rare property that every polynomial in its orbit closure
is the result of a variable substitution. A classification of all polynomials with this
property remains open, but we both answer and pose several questions to advance it.
In Chapter 7, we introduce additional techniques used in the two subsequent chap-
ters: We bound the number of irreducible components of the orbit closure boundary
by the number of smooth blowups required to resolve the indeterminacy of a related
rational map.
The first application of this technique yields a classification of the boundary of det3
in Chapter 8. Here, we also describe the stabilizer group of the determinant of a
generic traceless matrix and conclude that the orbit closure of this polynomial is
always a codimension one component of the boundary of the orbit closure of the
determinant.
The final chapter contains unpublished work on the universal sum of two mono-
mials, the binomial x1···xd+y1···yd. Like the monomial, the binomial is contained
in the orbit closure of detd. A firm understanding of this polynomial should therefore
4
precede the study of general determinantal expressions. The binomial already gives
rise to a significantly more involved geometry than the monomial. We can describe
two components of the boundary in detail, but must leave a subtle question open. If
said question could be answered affirmatively however, these two components con-
stitute the entire boundary of the orbit closure of the binomial.
5
Part I
Geometric Complexity Theory
7
Chapter 1
Algebraic Complexity Theory
The following is only the briefest of summaries: Complexity theory analyzes the
complexity of solving problems. Problems have instances of different sizes which can
be solved by algorithms. Denoting by t(m)the minimum number of steps performed
by any algorithm that solves all instances of size m, the complexity of a problem is
the function t:NN. The definitions of “problem”,“algorithm” and related notions
is encompassed by a model of computation. The classical model of computation is
the Turing machine, which mimics our present-day computers. Liberally quoting the
famous Church-Turing thesis, any problem that we consider computationally solvable
is solvable by a Turing machine. This generality comes at a price, however: The Turing
model provides very little mathematical structure to be exploited.
In this chapter, we explore the algebraic model, where a problem is given as a
family of polynomial functions and the goal is quite simply to evaluate them.
1.1 Arithmetic Circuits
×
+
xy
Figure 1.1.1: x2+xy
The algorithms in algebraic complexity theory are arith-
metic circuits. An arithmetic circuit is a schematic repre-
sentation of a way to compute a polynomial: Figure 1.1.1
shows a circuit computing x2+xy Z[x,y]. In general, an
arithmetic circuit is a directed, acyclic graph where at every
vertex, a polynomial is being computed. Vertices with no
ingoing edges contain constants or variables and vertices
with exactly two ingoing edges are either labeled with the
symbol + (Plus) or × (Times), computing the sum or
the product of their input, respectively. This concept can be generalized to the notion
of a circuit, which performs computation in an algebraic structure by associating to
any vertex with kingoing edges some k-ary operation.
9
1.1.1 Definition. Let Rbe a commutative ring and R[x]the polynomial ring over R
in a countably infinite set of variables x. An arithmetic circuit over Ris a directed
acyclic graph Cwith vertex labels, subject to the following conditions:
(1) The vertices with no incoming edges are labelled with elements of Rx. Any
such vertex is called an input gate.
(2) Since Cis acyclic, every vertex of Chas a well-defined depth, which is the length
of a longest path from an input gate to it. Any vertex of positive depth has exactly
two incoming edges and is labelled with an element of {+,×}. Any such vertex
is also called a computation gate.
We define P(v)R[x]for every vertex vrecursively by depth, as follows: If vis an
input gate, P(v)is defined to be the label of v. Otherwise, let v{+,×}be the label
of vand denote by uand wthe source vertices of the two incoming edges of v. Then,
we can define P(v):=P(u)vP(w).
The circuit Ccomputes an element PR[x]if there is a vertex vwith P=P(v).
The size of Cis the number of computation gates, denoted by|C|.
1.1.2 Example. In Figure 1.1.2, we give an example of an arithmetic circuit over C
which computes the equation of an affine elliptic curve in the variables{x,y}.
+
+
×
+
×
××
1xy
Figure 1.1.2: An arithmetic circuit computing y2+xy x31.
1.1.3 Definition. Let Rbe a commutative ring. The (circuit) complexity of a poly-
nomial Pwith coefficients in Ris the minimum size of an arithmetic circuit over R
which computes P. We denote this number by ccR(P).
10
It should be emphasized that complexity theory does not study the complexity of
single polynomials. Instead, the object of study are families of polynomials. This is
the main reason why it is convenient to work with infinitely many variables.
1.1.4 Example. Consider the polynomial Pm:=x2
1+···+x2
mR[x]for mN. We
study the map NNgiven by m ccR(Pm). We claim that 1
2mccR(Pm)2m,
which is interpreted by saying that the complexity of computing a Euclidean norm is
linear in the input size. It is easy to see that the circuit
+
···
++
× × × ··· ×
x1x2x3··· xm
computes Pmwith 2m1 computation gates. Since Pmis supported on mvariables,
each of which needs to be connected to some computation gate in a circuit comput-
ing Pm, we can see that ccR(Pm)1
2mbecause at most two variables can be connected
to the same gate. With a more refined argument, one can actually show ccR(Pm)m
in an even stronger model of computation [BCS97, Example 6.1].
1.1.5 Notation. We fix a commutative ring Rand a countably infinite set xof variables.
A polynomial will always be an element of R[x]unless stated explicitly otherwise. We
will also write PR[x1, . . . , xn]for a polynomial in nvariables and implicitly identify
the xiwith certain elements of x.
The elements of xare not sequentially numbered or named in any way, a priori.
We do not assume any ordering on x. We may often write x1, . . . , xrx, but we
may also write a,b,cxor yx.
This way, we always operate in the ring R[x], regardless of what (finite) number of
variables we require. Finally, we write cc(P)instead of ccR(P)unless we want to put
emphasis on the ring R.
Remark. In Chapter 2,Rwill be equal to Zand starting with Chapter 3,Rwill be the
field of complex numbers.
11
1.2 The Classes VP and P
Only for very few and rather simple families (Pm)mNof polynomials, we can deter-
mine the function m cc(Pm)“explicitly”. As is often the case in complexity theory,
we restrict instead to the classification of such functions by their asymptotic rate of
growth. Even this task turns out to be quite challenging.
1.2.1 Definition. A function t:NNis called polynomially bounded if there is a
polynomial pZ[x]and m0Nsuch that mm0:t(m)p(m). We define poly
as the set of all polynomially bounded functions NN.
Remark. We will often write t(m)poly(m)instead of tpoly, for example we
would write m3poly(m)instead of giving the function m m3a name. If a
numeric quantity t(m)Nis associated to each element of a family P= (Pm)mN,
we say that Phas (or admits) polynomially many of said quantity to express that
tpoly.
It turns out that it is not desirable to study arbitrary families of polynomials.
For example, the value x2mfor xZcannot be computed by a Turing machine in
polynomial time simply because the output is too large, but the circuit
××
···
××
x
computes it with only mcomputation gates.
1.2.2 Definition. We say that Pis a p-family if P= (Pm)mNis a family of polyno-
mials with deg(Pm)poly(m). The complexity class VP is defined to be the set of all
p-families Pwith cc(Pm)poly(m).
Remark. Although it is not visible in the notation, this definition depends on the coef-
ficient ring R.
VP is the circuit analogue of the complexity class P. The “V” in its name stands for
the name of its inventor Valiant, who introduced these notions in [Val79a].
1.2.1 The Class P
Classical complexity theory deals with families of boolean functions Fm
2F2. A
family Bm:Fm
2F2is in the complexity class Piff there exists a Turing machine
which computes Bmand requires polynomially many steps to do so. We do not give
the formal definition of a Turing machine, but it may be thought of as a computer
algorithm, where a step is a physically atomic operation for the machine.
12
The class Phas the nonuniform analogue P/ poly which has a description in terms
of circuits: Nonuniformity means that we are not asking for a single algorithm to
work on all input sizes, but we allow different algorithms for different input sizes. In
the language of circuits, a family Bm:Fm
2F2of boolean functions is in P/ poly if
and only if ccF2(Bm)poly(m). Note that by polynomial interpolation, any function
Fm
2F2is given by some polynomial in F2[x1, . . . , xm]F2[x].
1.3 Reduction and Completeness
An important concept in boolean as well as algebraic complexity theory is reduction.
Informally put, a problem Qcan be reduced to a problem Pif one can produce an
algorithm for Qfrom an algorithm for Pwithout measurably increasing the runtime.
This is only a vague description, of course. We will give the precise definition for the
algebraic model after introducing some notation.
1.3.1 Definition. Denote by N(x)the x-indexed sequences α= (αx)xxof natural
numbers with αx=0 for only finitely many xx. By definition, a polynomial
PR[x]is a map P:N(x)Rwhich we denote α Pαand require Pα=0 for only
finitely many αN(x). One then writes
P=
αN(x)
Pα·
xx
xαx.
We define the support of Pto be the set of variables that occur in this expression, i.e.
supp(P):={xxαN(x):αx=0Pα=0}.
Note that supp(P)is always a finite set. For PR[x]with supp(P) = {x1, . . . , xn},
the expression for Pbecomes the familiar P=αNnPα·n
i=1xαi
i.
1.3.2 Example. Let x,y,zx. For Q=x2y+z3, we have supp(Q) = {x,y,z}. The
polynomial P=x2y+y3can also be viewed as an element of C[x,y,z]but we have
supp(P) ={x,y}because zdoes not occur in P.
1.3.3 Remark. If PVP, then |supp(Pm)|poly(m). In other words, any family
in VP requires only polynomially many variables. Indeed, in a circuit of minimum
size computing Pm, every input gate containing a variable is connected to at least one
computation gate and each computation gate can be connected to at most two input
gates. Since there are only polynomially many computation gates, there can only be
polynomially many variables in a (nonredundant) expression for Pm.
13
We next define a partial order on R[x]where PQholds if Pcan be ob-
tained from Qby variable substitution. For example, PQholds in Example 1.3.2
because Parises from Qby substituting zfor y. This is formalized in the following
Definition 1.3.4:
1.3.4 Definition. For Sxand a map σ:SR[x], we extend σto a map xR[x]
by the identity. We then denote by Pσthe image of Punder the unique R-algebra
homomorphism R[x]R[x]which maps x σ(x)for all xx.
A polynomial Pis called a projection of another polynomial Qif there is some
Ssupp(Q)and a map σ:SxRsuch that P=Qσ. We denote this by PRQ.
The definition extends to families as follows: A p-family Pis a p-projection of another
p-family Q, in symbols PRQ, if there is some tpoly such that PmQt(m)for all
mN. We will write PQin both cases if there is no ambiguity concerning R.
1.3.5 Example. Let x,y,zxand Q:=y2z+xyz x3z3. Let σ:xR[x]be the
identity everywhere except for σ(z) = 1. Then, one obtains Qσ=y2+xy x31.
1.3.6 Example. Let aixand xixbe distinct variables for all iN. For dN, we
consider the family Q= (Qd)dNgiven by
Qd:=a0+a1x1+. . . +adxd
with supp(Qd) ={a0, . . . , ad,x1, . . . , xd}. Any p-family PVP of affine linear polyno-
mials satisfies PQ. Indeed, this follows because t(m):=|supp(Pm)|is polynomially
bounded by Remark 1.3.3.
1.4 The Classes VNP and NP
Valiant introduced the class VNP as an analogue of the class NP, but we will give a
definition of the class VNP first because our focus is on the algebraic model. We will
then discuss the relation to NP.
1.4.1 Definition. Let Pbe a p-family. Then PVNP if and only if there exists a family
QVP and a sequence (Cm)mNwith Cmsupp(Qm)such that Pis a projection of
the family ˜
Q= ( ˜
Qm)mN, which is defined as
˜
Qm:=
σ:Cm{0,1}
Qσ
m·
xCm
σ(x)=1
x.
Remark. We use here the original definition from [Val79a, p. 252] because we can later
make the analogy to classical complexity classes more clear. In subsequent literature,
the definition appears in seemingly weaker, but equivalent form [Bü00, p. 5].
14
Remark. It follows from the definition that VP VNP: For PVP, choose Q=Pand
Cm:=for all mN. By Definition 1.3.1, the empty map σ:{0,1}satisfies
Pσ
m=Pmfor all mN.
Remark. Note that for PVNP, we again have |supp(Pm)|poly(m). Indeed, in
Definition 1.4.1 we know that |supp(Qm)|poly(m)by Remark 1.3.3. Furthermore,
supp(˜
Qm)supp(Qm)and and since Pm˜
Qm, we have
|supp(Pm)|supp(˜
Qm)|supp(Qm)|poly(m).
1.4.1 The Class NP
We will now explain and motivate this definition by comparing Valiant’s class with
the classical one. A family Bm:Fm
2F2is in NP if and only if there are functions
tmpoly(m)and Cm:Fm
2×Ftm
2F2such that
the family (Cm)mNis in Pand
mN:σFm
2:(Bm(σ) = 1)(cFtm
2:Cm(σ,c) = 1).
In other words, it might not be easy to decide whether Bm(σ) = 1, but it is easy to
confirm that Bm(σ) = 1 if given a valid certificate cFtm
2.
Again, there is a relevant related complexity class known as #P. It contains families
of maps Fm
2N. Such a family Pm:Fm
2Nis in #Pif and only if there exists a
family (Bm)mNNP which has a certificate function (Qm)mNPsuch that Pm
counts the number of certificates, i.e.,
Pm(σ) ={cFtm
2Qm(σ,c) = 1}.
Note that Bm(σ) = 1 if and only if Pm(σ)>0.
If we interpret QmF2[x,y]where x={x1, . . . , xm}and y={y1, . . . , ytm}, then
˜
Qm:=
σ:y{0,1}
Qσ
m·
tm
k=1
σ(yk)=1
yk
is an element of VNP over F2by Definition 1.4.1 and for any σFm
2=Fx
2, i.e.,
viewing σas a map σ:x{0,1}=F2, we have ˜
Qσ
m=0 if and only if Bm(σ) = 0.
More precisely, ˜
Qσ
mhas exactly Pm(σ)monomials and the monomial
tm
k=1
ck=1yk
occurs in ˜
Qσ
mif and only if cFtm
2is a certificate with Qm(σ,c) = 1.
15
1.4.2 Completeness of the Permanent
1.4.2 Definition. Let Cbe some class of p-families, like VP or VNP. A p-family Pis
called C-complete if P C and Q C:QP.
Two completeness results by Valiant make the connection between #Pand VNP
even more tangible. They both concern the permanent polynomial family. The per-
manent of an m×mmatrix (xij)1i,jmis defined as
perm:=
πSm
m
i=1
xi,π(i), (1)
where Smdenotes the symmetric group on msymbols, i.e., the group of all set auto-
morphisms of [m]:={1, . . . , m}. We will consider per = (perm)mNas a family of
polynomials in the variables xij.
1.4.3 Theorem ([Val79b]). The problem of computing the permanent of a binary ma-
trix (one where every entry is either 0 or 1) is #P-complete.
1.4.4 Theorem ([Val79a]). The permanent family is VNP-complete over any field of
characteristic different from 2.
By a formula of Ryser [Rys63], we know that permcan be computed by a circuit
of size m·2m, but no significant improvement over this is known. This leads to the
following conjecture:
1.4.5 Conjecture (Valiant’s Hypothesis). The inclusion VP VNP is strict.
Given any VP-complete family P, one could prove VP =VNP by showing that the
permanent is not a p-projection of P.
1.5 Determinant Versus Permanent
The definition of the permanent (1) is quite similar to the familiar definition of the
determinant family
detd:=
πSd
sgn(π)·
d
i=1
xi,π(i).
It is known [MV97] that detdcan be computed by a circuit of size O(d6) this is not
immediately obvious because the classical procedure of Gaussian elimination per-
forms divisions: Arithmetic circuits only allow for multiplication and addition.
Unfortunately, the determinant is not known to be VP-complete, nor is it thought
to be. However, it is complete for a subclass of VP which we will study next.
16
1.5.1 Classes for the Determinant
The class of polynomials for which the determinant family is complete can be defined
in several different ways. We will not go into detail and instead refer to [MP08;Tod92].
However, we give one definition:
An arithmetic circuit Cis weakly skew if every vertex with label × has one
incoming edge which is a bridge, i.e., removing this edge increases the number of
connected components of C.
1.5.1 Example. Recall the circuit computing y2+xy x31 from Example 1.1.2. It
is not weakly skew, the condition is for example violated at the topmost multiplica-
tion gate computing xy x31. However, the circuit can be made weakly skew by
duplicating some of the input gates: See Figure 1.5.1. At each multiplication gate, the
input edge which is a bridge has been highlighted.
+
+
×
+
×
××
1
x x x y y
Figure 1.5.1: A weakly skew arithmetic circuit computing y2+xy x31.
Denote by ccws (P)the minimum number dsuch that Pcan be computed by a weakly
skew circuit of size d.VPws consists of all p-families Psuch that ccws (Pm)poly(m).
1.5.2 Theorem ([Tod92],[MP08, Lemma 6]). The determinant family is VPws-complete
and for any PR[x]with d:=ccws (P), we have Pdetd+1.
Clearly, VPws VP and therefore, if Valiant’s Hypothesis (Conjecture 1.4.5) holds,
the inclusion VPws VNP must also be strict. Since the determinant is VPws-complete
and the permanent is VNP-complete by Theorem 1.4.4,VPws =VNP is equivalent to
the following:
17
1.5.3 Conjecture (Permanent vs. Determinant). The permanent polynomial family is
not a p-projection of the determinant polynomial family.
1.5.2 Determinantal Complexity
In order to approach Permanent versus Determinant (Conjecture 1.5.3), we need to
study p-projections of the determinant. We will show that the determinant is uni-
versal, meaning that every polynomial is a projection of detdfor some dN. This
important insight gives rise to the definition of determinantal complexity: For a poly-
nomial Pwe define
dc(P):=min{dN|Pdetd}.
This allows us to reformulate Conjecture 1.5.3 simply as dc(perm)/poly(m). The
benefit of this reformulation is the fact that there are no more circuits involved in it
and methods from algebra directly apply.
1.5.4 Theorem ([Val79a, §2]). Let Rbe a commutative ring and PR[x1, . . . , xn]a
polynomial. Then, there is a natural number dNsuch that Pis a projection of detd.
Remark. This highlights the importance of the determinant in a very fundamental
way. Quoting Valiant himself, “for the problem of finding a subexponential formula for a
polynomial when one exists, linear algebra is essentially the only technique in the sense that it
is always applicable.”
Proof. The proof is based on a graph construction which we will sketch here: Given
a matrix a= (aij)1i,jmwe can consider the labelled directed graph Gaon the set
of vertices {1, . . . , m}where the edge (i,j)has label aij. We treat edges with label 0
as nonexistent. For a directed graph with labels in R[x1, . . . , xn], we can reverse this
process and obtain a matrix from Gwhich we will refer to as its adjacency matrix. A
cycle cover of Gais a partition of Gainto vertex disjoint cycles. The permutations on the
set{1, . . . , m}are in bijection with the cycle covers of Ga. To each cycle we associate a
sign, which is 1 if the cycle has even length and 1 otherwise. To each cycle we then
associate a weight, which is its sign times the product of its edge labels. The weight
of a cycle cover is the product of the weights of its cycles. The determinant of ais
then, by definition, the sum of the weights of all cycle covers of Ga.
Given a polynomial PR[x1, . . . , xn], let k:=deg(P)be its (total) degree. A
product of k+1 constants and variables can produce any monomial that occurs in
P, so we can write P=r
i=1k
j=0Pij for some rNand Pij R{x1, . . . , xn}. For
example, consider the polynomial
P=x3+1y2xy = (x·x·x) + (1·1·1) + (1·y·y) + (1·x·y).
18
For this example, we used only 3 factors in each summand as opposed to the 4 factors
that the construction would yield for a general cubic polynomial.
t
x
1
y
y
x
1
y
x
s
x
1
1
1
We construct a graph Has follows: For each
of the rsummands in the above representation
of P, consider a path with k+1 edges where the
edges of path iare labeled with the values pij
for 0 jk.
We identify the start vertices of all these
paths and call it s, then we identify the end ver-
tices of all these paths and call it t. We obtain an acyclic graph with rpaths going
from sto t.
v
2
1
y
1y
1
1
4 1
y
3
x
1
1
5
1
x
6x
1
x
7
1
1
8
1
1
1
Figure 1.5.2: The Graph G
Let then Gbe the graph that arises from Hby
first adding loops with label 1 to all vertices ex-
cept sand tand then identifying sand tinto a
single vertex v.
By construction, every cycle in Gwhich is
not a loop contains the vertex vand all these cy-
cles have the same length. Therefore, the cycle
covers of Gare in bijection with the summands
k
j=0pij via their weight, up to a common sign.
It follows that the adjacency matrix aof Gsat-
isfies det(a) = ±P. If det(a) = P, we can
achieve det(a) = Pby considering the block ma-
trix (a0
01)instead of a.
Remark. Treating the vertex vin our example as the 9-th vertex and numbering all
other vertices as in Figure 1.5.2, one can check that indeed
det
1y0000000
0 1 0 0 0 0 0 0 y
001x0 0 0 0 0
0 0 0 1 0 0 0 0 y
0 0 0 0 1 0 0 0 x
0 0 0 0 x1 0 0 0
000000101
000000110
1 0 100x010
=x3+1y2xy.
However, this procedure does not yield the optimal result. For example,
det (1 0 x0
0 1 0 y
0 0 1 x
x y y 1)=x3+1y2xy.
We used computational methods akin to the algorithm from the next chapter to com-
pute this representation.
19
Chapter 2
Binary Determinantal Complexity
This chapter contains the contents of the previously published work [HI16]. We only
consider polynomials with integer coefficients here, i.e., we assume R=Z.
Proving Valiant’s Hypothesis (Conjecture 1.4.5) amounts to bounding the growth
of dc(perm)superpolynomially from below. Lower bounds are a notoriously difficult
problem in complexity theory. On the other hand, finding upper bounds admits the
straightforward approach of constructing algorithms: In our case, an algorithm can
be described as a sequence of matrices Am(xZ)tm×tmsuch that det(Am) = perm.
Here, the numbers tmdc(perm)achieve equality if and only if the algorithm is
optimal. To get a better idea of how dc(perm)grows, it is therefore reasonable to
attempt the construction of good algorithms. The best one known so far is a graph
construction by Grenet [Gre11], see Section 2.5, with the following consequence:
2.0.1 Theorem. For every natural number mthere exists a matrix Aof size 2m1
such that perm=det(A). Moreover, Acan be chosen such that the entries in Aare
only variables, zeros, and ones, but no other constants.
Theorem 2.0.1 gives rise to the following definition: We call a matrix whose entries
are only zeros, ones, or variables, a binary variable matrix. We will prove in Corol-
lary 2.1.3 that every polynomial Pwith integer coefficients can be written as the de-
terminant of a binary variable matrix and that the size is almost equal to dc(P), see
Proposition 2.1.2 for a precise statement. We then denote by bdc(P)the smallest d
such that Pcan be written as a determinant of an d×dbinary variable matrix. It is
called the binary determinantal complexity of P.
The complexity class of families (Pm)mNwith polynomially bounded binary de-
terminantal complexity bdc(Pm)is exactly VP0
ws, the constant free version of VPws, see
Section 2.4 for definitions and proofs.
Theorem 2.0.1 shows that bdc(perm)2m1. This upper bound is clearly sharp
for m=1 and for m=2, and we can also verify that it is sharp for m=3:
2.0.2 Theorem. bdc(per3) = 7.
21
We use a computer aided proof and enumeration of bipartite graphs in our study.
The binary determinantal complexity of permis now known to be exactly 2m1 for
m{1,2,3}. Unfortunately, determining bdc(per4)is currently out of reach with our
methods.
The best known general lower bound is bdc(perm)dc(perm)m2
2due to
[MR04] in a stronger model of computation, see also [LMR13] for the same bound
in an even stronger model of computation. After the proof of Theorem 2.0.2 was
published, Alper, Bogart, and Velasco proved in [ABV15] that dc(per3) = 7, but un-
fortunately per4remains out of reach even with their methods.
2.1 The Cost of Computing Integers
The main purpose of this section is to prove that even though we only allow the
constants 0 and 1, all polynomials with integer coefficients can be obtained as the
determinant of a binary variable matrix, see Corollary 2.1.3. Moreover, the size of
the matrices is not much larger than had we allowed integer constants, see Propo-
sition 2.1.2. We use standard techniques from algebraic complexity theory, heavily
based on [Val79a], but a certain attention to the signs has to be taken.
In what follows, a digraph is always a finite directed graph which may possibly
have loops, but which has no parallel edges. We label the edges of a digraph by
polynomials. We will almost exclusively be concerned with digraphs whose labels
are only variables or the constant 1. Note that we consider only labeled digraphs.
A cycle cover of a digraph Gis a set of cycles in Gsuch that each vertex of G
is contained in exactly one of these cycles. If a cycle in Ghas iedges with labels
e1, . . . , ei, then its weight is defined as (1)i1·e1···ei. The weight of a cycle cover is
the product of the weights of its cycles. The value of Gis the polynomial that arises as
the sum over the weights of all cycle covers in G. We then define the directed adjacency
matrix A of a digraph G as the matrix whose entry Aij is the label of the edge (i,j)or
0 if that edge does not exist.
In what follows, we will often construct matrices as the directed adjacency matrices
of digraphs. The reason is the well-known observation that the value of a digraph G
equals the determinant of its directed adjacency matrix see for example [Val79a].
As an intermediate step, we will often construct a binary algebraic branching pro-
gram: This is an acyclic digraph Γ= (Γ,s,t)where every edge is labeled by either 1 or
a variable. The digraph Γhas two distinguished vertices, the source s and the target t,
where shas no incoming and thas no outgoing edges. If an s-t-path in Γhas iedges
with labels e1, . . . , ei, then its path weight is defined as the value (1)i1·e1···ei.
The path value of Γis the polynomial that arises as the sum over the path weights of
22
all s-t-paths in Γ. We remark that this notion of weight differs from the literature by a
sign.
2.1.1 Proposition. For a nonzero constant cZ, there is a binary algebraic branching
program Γwith at most O(log |c|)vertices whose path value is c.
Proof. We can assume without loss of generality that c>0: Given a binary algebraic
branching program Γwith path value c>0 and at most O(log c)vertices, we can
add a single vertex tand an edge from tto twith label 1 to obtain a new program
(Γ,s,t)with path value c.
For a natural number c, an addition chain of length is a sequence of distinct natural
numbers 1 =c0,c1, . . . , c=ctogether with a sequence of tuples (j1,k1), . . . , (j,k)
such that ci=cji+ckiand ji,ki<ifor all 1 i. However, we will think of
this data as a digraph ˜
Γon the vertices {v0, . . . , v}with edges (vji,vi)and (vki,vi)
for all 1 i. The labels of all edges are equal to 1. Note that we allow double
edges in these digraphs temporarily. We set s:=v0and t:=v. Thus, we view an
addition chain as an acyclic digraph where every vertex except for v0has indegree
two. This already strongly resembles a binary algebraic branching program, but ˜
Γ
might have parallel edges. Observe that there are exactly cimany paths from v0to vi
in the digraph ˜
Γ. In particular, there are exactly cpaths from sto tin ˜
Γ.
Using the algorithm of repeated squaring [Knu98, Sec. 4.6.3, eq. (10)] one can
construct an addition chain ˜
Γas above with at most O(log c)vertices and such that
there are exactly cpaths from sto tin ˜
Γ. For every edge (v,w)in ˜
Γwe add a new
vertex uand replace the edge (v,w)by two new edges (v,u)and (u,w). We call the
resulting digraph Γ= (Γ,s,t). Observe that the binary algebraic branching program Γ
has no parallel edges any more and all s-t-paths in Γhave even length. Also, the
digraph Γstill has O(log c)many vertices. Labelling all edges in Γwith 1, the path
value of Γis equal to c.
2.1.2 Proposition. Let Cbe a d×dmatrix whose entries are variables and arbitrary
integer entries. Let cmax be the integer entry of Cwith the largest absolute value. Then
there is a binary variable matrix Aof size O(d2·log |cmax|)with det(A) = det(C).
Proof. We will interpret Cas the directed adjacency matrix of a digraph. Any edge
that has an integer label which is neither 1 nor 0 will be replaced by a subgraph
of size O(log |cmax|)arising from the construction of the previous Proposition 2.1.1.
The directed adjacency matrix of the resulting graph will be the desired matrix A.
Formally, we proceed by induction.
Denote by kthe number of integer entries in the matrix Cthat are neither equal
to 0 nor 1. By induction on k, we will prove the slightly stronger statement that there
23
x3y
2
x
x3y
2
x
x3y
x
x3y
x
x3y
x
C=
3 0 2
0x0
x0y
det(C) = 3xy 2x2
Figure 2.1.1: Given a matrix Cwe construct a digraph Hwith directed adjacency matrix C(left hand
side) and the digraph G(right hand side) by replacing the edge with label 2 in Hby a binary algebraic
branching program. We omit the labels for edges that have label 1. The right hand side depicts the
cycle covers Kof Gand the left hand side shows the corresponding cycle covers KHof H.
is a binary variable matrix Aof size d+k·O(log |cmax|)with det(A) = det(C). Since
kd2, this implies the statement. Note that the case k=0 is trivial, so we assume
k1 and perform the induction step.
Let Hbe the digraph whose directed adjacency matrix is C. Recall that this means
the following: His a digraph on the vertices 1, . . ., dand there is an edge (i,j)with
label Cij if Cij =0 and otherwise no such edge exists. Let e= (i,j)be the edge
corresponding to an integer entry c=Cij which is neither 0 nor 1. Let Γ= (Γ,s,t)be
a binary algebraic branching program with path value cand O(log |c|)many vertices,
which exists by Proposition 2.1.1.
We will now replace the edge (i,j)by Γ(see Figure 2.1.1): Let Gbe the digraph that
arises from HΓby removing the edge (i,j), adding edges (i,s)and (t,j)with label
1 and adding loops with label 1 to all vertices of Γ. The directed adjacency matrix of
Ghas size d+O(log |c|)d+O(log |cmax|)and contains k1 integer entries which
are neither 0 nor 1. By applying the induction hypothesis to the directed adjacency
matrix of G, we obtain a matrix Aof size
d+O(log |cmax|) + (k1)·O(log |cmax|) = d+k·O(log |cmax|)
24
whose determinant equals the value of G. We are left to show that the value of Gis
equal to det(C), i.e., the value of H.
For this purpose, we will analyze the relation between cycle covers of Gand H,
which is straightforward (see Figure 2.1.1): Consider a cycle cover Kof G. Any vertex
of Γwhich is not covered by its loop must be part of a cycle whose intersection with Γ
is a path from sto t. To Kwe can therefore associate a cycle cover KHof Has follows:
If every vertex of Γis covered by its loop in K, let KHbe Kwithout these loops.
Otherwise, there is a unique cycle κKin Kthat restricts to an s-t-path πKin Γ. Let κH
K
be the intersection κKHtogether with the edge (i,j)and note that κH
Kis a cycle in H.
We obtain KHfrom Kby replacing κKwith κH
Kand removing all remaining loops from
inside Γ.
All cycle covers Lof Hare of the form L=KHfor some cycle cover Kof G. If Lis
a cycle cover of Hcontaining the edge (i,j)then the cycle covers Kof Gwith L=KH
are in bijection with the s-t-paths in Γ. We now fix such a cycle cover L. By definition
of the value of a digraph, it suffices to show that
Kcycle cover of G
such that L=KH
wt(K) = wt(L).
Note that Kand L=KHdiffer only in loops and in the cycles κKand κH
K, respectively.
Since loops contribute a factor of 1 to the weight of a cycle cover, we are left to prove
that
Kcycle cover of G
such that L=KH
wt(κK) = wt(κH
K).
Let e1, . . . , erbe the labels of the edges of κKH. These are the edges shared by κK
and κH
K. Thus,
wt(κH
K) = (1)r·c·e1···er=
πis s-t-path
inside P
wt(π)
·(1)r·e1···er
=
Kcycle cover of G
such that L=KH
wt(πK)
·(1)r·e1···er
=
Kcycle cover of G
such that L=KH
wt(πK)·(1)r·e1···er
=
Kcycle cover of G
such that L=KH
wt(κK)
is precisely the desired equality.
25
2.1.3 Corollary. For every polynomial PZ[x]there exists a binary variable matrix
whose determinant is P.
Proof. Combine Theorem 1.5.4 and Proposition 2.1.2.
2.2 Lower Bounds
This section is dedicated to the proof of Theorem 2.0.2. Let B:={0,1}. A sequential
numbering makes the proof much easier to read, so we think of the variables as
arranged in a 3 ×3 matrix
x=
x1x2x3
x4x5x6
x7x8x9
.
In this section, we will understand per3=per(x)as a polynomial in the variables
x1, . . . , x9instead of the variables xij with 1 i,j3.
2.2.1 Proof Outline
Let dNand Aan d×dbinary variable matrix. The binary matrix B(A)Bd×d
is defined as the matrix arising from Aby setting all variables to 1. We call B(A)the
support matrix of A. If we set all variables to 1 in per3, we obtain the value 6, so if
per3=det(A), then substituting 1 for all variables on both sides of the equation, we
obtain the condition
6=det(B(A)). (1)
In [EZ62;Slo11], the maximal values of determinants of binary matrices are computed
for small values of d. Since
BB5×5: det(B)5, (2)
we immediately obtain the lower bound bdc(per3)6.
Unfortunately, there are several matrices BB6×6that satisfy det(B) = 6. We
proceed in two steps to verify that nevertheless, none of these matrices Bis the support
matrix B(A)of a candidate matrix Awith per3=det(A). A rough outline is the
following:
(a) Enumerate all matrices BB6×6with det(B) = 6 up to symmetries.
(b) For all those matrices Bprove that Bis not the support matrix B(A)of a binary
variable matrix Awith det(A) = per3. We describe this process in the next
subsection.
26
2.2.2 Stepwise Reconstruction
Let us make (b) precise. In the hope of failing, we attempt to reconstruct a binary
variable matrix Athat has support Band which also satisfies det(A) = per3. During
the reconstruction process, we successively replace 1’s in Bby the next variable. The
process is as follows:
Given a binary matrix BB6×6, let
S:={(i,j)Bij =1}
be the set of possible variable positions. For any set of positions IS, we consider
the matrix BIthat arises from Bby placing a variable yin every position in I. If Bis
the support of a binary variable matrix Awith det(A) = per3and Icontains exactly
the positions where y:=x1occurs in A, then det(BI)must be equal to
per3
y1 1
1 1 1
1 1 1
=2y+4. (3)
We define the set
S:={IS|det(BI) = 2y+4}.
2.2.1 Lemma. Let Abe a binary variable matrix with support Band det(A) = per3.
Let k{1, . . . , 9}and define Ik:={(i,j)Aij =xk}as the set of positions where the
variable xkoccurs in A. Then, we have Ik S.
Proof. By the symmetry of the permanent, we may assume that k=1. In the matrix A,
setting every variable except y:=x1to 1 yields the matrix BIand therefore, det(BI) =
2y+4 as in (3), because det(A) = per3. This means Ik S by definition.
Therefore, if Bis the support matrix B(A)of a binary variable matrix Awith
det(A) = per3, we can find 9 pairwise disjoint sets in S, one for each variable xk,
that specify precisely where to place these variables in A. By a recursive search and
backtracking, we now look for sets I1, . . . , Ik S such that
iI1, . . . , Ikare pairwise disjoint.
ii Placing xiinto Bat every position from Iifor 1 ikyields a matrix Aksuch
that det(Ak)is equal to per3(x1, . . . , xk, 1, . . . , 1).
The search is recursive in the following sense: First, the possible choices at depth k=1
are given by S. Enumerating the possible choices for depth k+1 works as follows: For
each choice I1, . . . , Ik S with the above two properties, we enumerate all Ik+1 S
that have empty intersection with I1··· Ikand check whether condition (ii) is
satisfied.
27
If the recursive search never reaches k=9 or fails there, then Bis not the support
of a binary variable matrix Awith det(A) = per3. If we reach level 9 however and do
not fail there, we have found such an A.
In practice, the evaluation of det(A)is sped up significantly by working over a
large finite field Fpand choosing random elements x1, . . . , x9Fp\{0,1}.
2.2.3 Exploiting Symmetries in Enumeration
Let us call two matrices equivalent if they arise from each other by transposition and/or
permutation of rows and/or columns. A key observation is that equivalent matrices
have the same determinant up to sign. Therefore we do not have to list all binary
matrices BB6×6with det(B) = 6, but it suffices to list one representative matrix
Bwith det(B) = ±6 for each equivalence class. It happens to be the case that the
equivalence classes of 6×6 binary matrices are in bijection to graph isomorphy classes
of undirected bipartite graphs G= (VW,E)with |V|=|W|=6, VW=as
follows: For V={v1, . . . , v6}and W={w1, . . . , w6}, the bipartite adjacency matrix
B(G)B6×6of Gis defined via B(G)i,j=1 if and only if {vi,wj}E. Row and
column permutations in B(G)are reflected by renaming vertices in G. Transposition
of B(G)amounts to switching Vand Win G.
The computer software nauty [MP13] can enumerate all 251610 of these bipar-
tite graphs, which is already a significant improvement over the 236 =68719476736
elements of B6×6. To further limit the number of bipartite graphs that have to be
considered, we make the following observations:
We need not consider binary matrices Bcontaining a row iwith only a single entry
Bij equal to 1. Indeed, Laplace expansion over the i-th row yields that det(B)is
equal to the determinant of a 5 ×5 binary matrix, which can at most be 5, see (2).
Translating to bipartite graphs, we only need to consider those bipartite graphs
where all vertices have at least two neighbours.
If two distinct vertices in Ghave the same neighbourhood, then the bipartite adja-
cency matrix B(G)has two identical rows (or columns) which would imply
det(B(G)) = 0. Hence, we only need to enumerate bipartite graphs where all ver-
tices have distinct neighbourhoods. Unfortunately nauty can impose this restriction
only on rows and not on columns.
With these restrictions, the nauty command
genbg -d2:2 -z 6 6
generates 44384 bipartite graphs, only 263 of which have a bipartite adjacency matrix
with determinant equal to ±6. We then preprocess this list by swapping the first two
rows of any matrix with negative determinant.
28
Finally, the stepwise reconstruction (Subsection 2.2.2) fails for all of these 263 ma-
trices, proving that bdc(per3)7. The algorithm takes 28 seconds on an Intel Core™
i7-4500U CPU (2.4 GHz) to finish.
Unfortunately, bdc(per4)can currently not be determined in this fashion because
the enumeration of all apropriate bipartite graphs, already on 9 +9 vertices, is infea-
sible.
2.3 Uniqueness of Grenet’s construction in the 7×7case
The methods from Section 2.2 can be used to determine all 7 ×7 binary variable
matrices Awith the property that det(A) = per3. By means of a cluster computation
over the course of one week, we determined all 463 binary variable matrices with this
property and made some noteworthy discoveries.
The Grenet construction (see Section 2.5) yields the matrix
x11 x12 x13 0000
100x32 x33 0 0
010x31 0x33 0
0010x31 x32 0
000100x23
000010x22
000001x21
. (4)
It is the unique “sparse” 7×7 binary variable matrix from among the 463, in the sense
that every other matrix from the list has more than three nonzero entries in some row
or column.
Motivated by the above observation, we verified by hand (with computer support)
that in fact, all of the 463 matrices can be reduced to (4) by means of elementary row
and column operations. This can be summarized as follows:
2.3.1 Proposition. Every 7 ×7 binary variable matrix Awith det(A) = per3is equiv-
alent to the Grenet construction (4) under the following two group actions:
(1) The action of{(g,h)|det(g) = det(h)}GL7(Z)×GL7(Z)on 7 ×7 matrices via
left and right multiplication, together with transposition of 7 ×7 matrices.
(2) The action of S3×S3on the variables xij with 1 i,j3, and the corresponding
transposition (i.e., the map xij xji.)
Note that (1) leaves the determinant of any 7 ×7 binary variable matrix invariant and
(2) leaves the permanent polynomial invariant.
29
2.3.2 Example. One of the matrices that occur in our enumeration is the matrix
A:=
x31 x32 x31 0x32 1x23
1x33 0x31 x33 x31 x22
x33 0x33 x32 1x32 x21
101000x22
0x11 x12 x13 000
010010x21
000101x23
.
One can check that indeed det(A) = per3. In this case, the matrices
g:=
0 0 0 0 1 0 0
0 0 1 0 0 1 0
0 1 0 1 0 0 0
1 0 0 0 0 0 1
0 0 0 0 0 0 1
0 0 0 1 0 0 0
0 0 0 0 0 1 0
,h:=
0 1 0 0 1 0 0
1 0 0 0 0 0 0
01 0 0 0 0 0
0 0 10000
1 0 0 0 0 1 0
0 0 1 1 0 0 0
0 0 0 0 0 0 1
are both invertible over Zand gAh is precisely (4).
2.4 Algebraic Complexity Classes
In this section we relate binary determinantal complexity to classical complexity mea-
sures. An algebraic circuit is called skew if for every multiplication gate at least one
of its two parents is an input gate. We denote by ccs(P)the minimum number dsuch
that Pcan be computed by a skew circuit of size d.VPsconsists of all p-families P
such that ccs(Pm)poly(m).
If all input gates of a circuit Care labeled with either 1, 1, or a variable, the circuit
is called constant-free. Note that every constant-free circuit computes a polynomial
that has integer coefficients. We denote by cc0
sand cc0
ws analogously defined complex-
ity measures with the additional condition that only constant-free circuits are allowed.
The complexity classes VP0
sand VP0
ws are defined as consisting of all p-families Psuch
that cc0
s(Pm)and cc0
ws(Pm)are respectively sequences in poly(m). See also [Mal03].
A fundamental result in [Tod92] (see also [MP08]) is that VPws =VPs, so VPsis
another class for which the determinant is complete recall Subsection 1.5.1. Analyz-
ing the constants which appear in the proof of VPws =VPsin [Tod92], we see that the
proof immediately yields VP0
ws =VP0
s. For the sake of comparison with VP0
s, let us
make the following definition.
30
2.4.1 Definition. The complexity class DETP0consists of all sequences of polynomials
that have polynomially bounded binary determinantal complexity bdc.
The main purpose of this section is to show the following statement.
2.4.2 Proposition. VP0
ws =VP0
s=DETP0.
Proof. The proof of [Tod92, Lemma 3.4] immediately shows that DETP0VP0
s. To
show that VP0
sDETP0we adapt the proof of [Tod92, Lemma 3.5 or Theorem 4.3],
but a subtlety arises: The proof shows that from a weakly skew or skew circuit Cwe
can construct a matrix Aof size polynomially bounded in the number of vertices in C
such that det(A)is the polynomial computed by Cwith the drawback that Ais not
a binary variable matrix, but Ahas as entries variables and constants 0, 1, and 1.
Fortunately Proposition 2.1.2 establishes DETP0=VP0
s=VP0
ws.
2.4.3 Remark. In the past, other models of computation with bounded coefficients
have already given way to stronger lower bounds than their corresponding unre-
stricted models: [Mor73] on the Fast Fourier Transform, [Raz03] on matrix multipli-
cation, and [BL04] on arithmetic operations on polynomials.
From Valiant’s completeness result [Val79a] we deduce that VP =VNP implies
perm/VP0
ws. A main goal is to prove perm/VP0
ws unconditionally. This could be a
simpler question than VP =VNP or even VP0=VNP0, because with what is known
today, from permVP0
ws we cannot conclude VP0=VNP0, see [Koi04, Thm. 4.3]. If
we replace the permanent polynomial by the Hamiltonian Cycle polynomial
HCm:=
πSm
πis m-cycle
m
i=1
xi,π(i),
then the question HCm/VPws is indeed equivalent to separating VP0
ws from VNP0,
see [Koi04, Thm. 2.5], mutatis mutandis.
We ran our analysis for HCm,m4 and proved bdc(HC1) = 1, bdc(HC2) = 2,
bdc(HC3) = 3, bdc(HC4)7. The matrices are given at the end of Section 2.5
in (5). This means that 7 bdc(HC4)13, where the upper bound follows from
considerations analogous to Grenet’s construction, see Section 2.5.
2.5 Graph Constructions for Polynomials
In this section, we review the proof of Theorem 2.0.1 from [Gre11]. Furthermore,
we use the same methods to prove the following result about the Hamiltonian Cycle
polynomial:
31
2.5.1 Theorem. We have bdc(HCm+1)m·2m1+1 for all mN.
Recall that we denote by [m]:={1, . . . , m}the set of numbers between 1 and m.
2.5.1 Grenet’s Construction for the Permanent
We prove Theorem 2.0.1. The construction of Grenet is a digraph Γwhose vertices
V:={vI|I[m]}are indexed by the subsets of [m]. Hence, d:=|V|=2m. We
partition V=V0···Vmsuch that Vicontains the vertices belonging to subsets
of size i. We set s:=vand t:=v[m], so V0={s}and Vm={t}. Edges will go
exclusively from Vi1to Vifor i[m]. In fact, we insert an edge from vIto vJif
and only if there is some j[m]with J=I{j}. This edge is then labeled with the
variable xij, where i=|J|. For example, there are medges going from V0to V1, one
for each variable x1jwith 1 jm. It is clear that for each permutation πSm,
there is precisely one s-t-path in Γwhose path weight is (1)m1·x1,π(1)···xm,π(m).
Consequently, the path value of the algebraic branching program Γ= (Γ,s,t)is equal
to (1)m1·perm. Theorem 2.0.1 then follows from the following lemma:
2.5.2 Lemma. Let Γ= (Γ,s,t)be a binary algebraic branching program on d3
vertices with path value ±P. Then, there is a binary variable matrix of size d1
whose determinant is equal to P.
Remark. The proof of this lemma is essentially identical to the proof of Theorem 1.5.4,
but it seems convenient to present it anyway.
Proof. We first construct a graph Gfrom Γby identifying the two vertices sand t
and adding loops with label 1 to every other vertex. The s-t-paths in Γare then in
one-to-one correspondence with the cycle covers of G: Indeed, any cycle cover in G
must cover the vertex s=tand this cycle corresponds to an s-t-path in Γ. Every
other vertex can only be covered by its loop because Γis acyclic. The graph Gnow
has the value ±Pby definition and its directed adjacency matrix Ahas size d1.
Since d12, we can exchange the first two rows of Ato change the sign of its
determinant.
2.5.2 Hamiltonian Cycle Polynomial
In this subsection, we prove Theorem 2.5.1 using Lemma 2.5.2. In order to construct
a binary algebraic branching program Γ= (Γ,s,t)with path value HCm+1, we pro-
ceed similar to Grenet’s construction for the permanent. We will refer to cyclic per-
mutations in Sm+1of order m+1 simply as cycles because no cyclic permutations
of lower order will be considered. Observe that the cycles in Sm+1are in bijection
32
with the permutations in Sm. This can be seen by associating to πSmthe cy-
cle σ= (π(1), . . . , π(m),m+1)Sm+1. In other words, σmaps m+1 to π(1), it
maps π(1)to π(2)and so on.
In addition to two vertices sand t, our binary algebraic branching program will
have a vertex v(I,i)for every nonempty subset I[m]and iI. By our above
Lemma 2.5.2, the resulting binary variable matrix will have a size of
1+
m
i=1(m
i)·i=m·2m1+1.
For m=3, this is equal to 3 ·22+1=13.
We construct the edges in Γso that every cycle σ= (a1, . . . , am,m+1)corresponds
to an s-t-path which has v(I,i)as its k-th vertex if and only if I={a1, . . . , ak}and
i=ak. We insert the following edges:
from sto v({i},i)for each i[m]with label xm+1,i
from v(I,i)to v(I{j},j)for each iI[m]and j[m]\Iwith label xi,j
from v([m],i)to tfor each i[m]with label xi,m+1.
We can again partition the set of vertices as V=V0. . . Vm+1where V0={s},
Vm+1={t}and for k[m], the set Vkconsists of all vertices v(I,i)with|I|=k. Then,
edges go only from Vkto Vk+1, in particular Γis acyclic. Furthermore, all s-t-paths in Γ
have the same lengths and correspond uniquely to cycles in Sm+1. This concludes
the proof of Theorem 2.5.1.
We know of no better construction for arbitrary m, but for small mwe have
HC2=det (x12 0
0x21)HC3=det
0x12 x13
x21 0x23
x31 x32 0
. (5)
33
Chapter 3
Geometric Complexity Theory
Recall Definition 1.3.4 for the notation PRQwhen P,QR[x]are polynomials
over a commutative ring R. We call two polynomials Pand Qequivalent if PRQ
and QRP. We write PRQin this case and PQif there is no ambiguity
concerning the ring R. This notion has been used for p-families already [Bü00, p. 8]
but we require it here for single polynomials only.
It follows directly from the definition that circuit complexity and (binary) deter-
minantal complexity are examples for complexity measures in the following sense:
3.0.1 Definition. Let Rbe a commutative ring. An arithmetic complexity measure
over Ris a function c:R[x]Nsuch that PRQimplies c(P)c(Q).
Remark. For every complexity measure c, equivalent polynomials clearly have the
same complexity with respect to c. Observe also that equivalent polynomials are
identical up to renaming the variables, which follows easily from the definition. It is
straightforward to check that is an equivalence relation.
Notation. If Pand Qare two sets of polynomials, we write P Q to indicate that
(P/)=(Q/), i.e., the equivalence classes of polynomials represented by Pand Q
are the same.
We will work exclusively over the field R=Cof complex numbers from here on
in, mainly because it simplifies the representation theory and algebraic geometry that
enters the picture later.
3.1 Orbit and Orbit Closure as Complexity Measure
We will now introduce a new complexity measure which is the first step towards a ge-
ometric interpretation of Conjecture 1.5.3. Let Wd:=Cd×dbe the space of d×dsquare
matrices. It is a complex vector space and End(Wd)denotes the set of all endomor-
phisms a:WdWd. The determinant is a map detd:WdC, so for aEnd(Wd),
35
we can consider the composition detda, which is again a polynomial function: This
is just linear substitution of variables. We then define the determinantal orbit com-
plexity of Pas
doc(P):=min{dN|aEnd(Wd):Pdetda}, (1)
which is well-defined because doc(P)dc(P)for all polynomials P. Our main goal
will be to show that the two measures are actually equivalent in the following sense:
3.1.1 Definition. Let Xbe a set and c1,c2:XNtwo functions. We say that c1and
c2are polynomially equivalent if there exist t1,t2poly such that c1(x)t2(c2(x))
and c2(x)t1(c1(x)) for all xX. We denote this by c1c2.
3.1.2 Proposition. We have dc doc.
Proof. The inequality doc dc is clear. The following Lemma 3.1.3 states that there is
a function tpoly such that dc(detda)t(d)for every aEnd(Cd×d). Since any
polynomial Pwith d:=doc(P)admits a linear transformation aEnd(Cd×d)with
Pdetda, we have dc(P)dc(detda)t(d) = t(doc(P)).
3.1.3 Lemma. There exists a function tpoly such that for all dNand all linear
maps a:Cd×dCd×d, we have dc(detda)t(d).
Proof. There is some spoly and weakly skew circuits ˜
Cdcomputing the polynomial
detdwith ˜
Cds(d), see [MV97]. Let now a:Cd×dCd×dbe a linear map and
denote by aij :Cd×dCits (i,j)-th component. Each aij is a linear form in n=d2
many variables. Note that a linear form n
i=1aixiwith a1, . . . , anCcan be computed
by a weakly skew circuit of size 2n1:
+
···
+
× × ··· ×
a1x1a2x2··· anxn
(2)
We construct a weakly skew circuit Cdcomputing detdaas follows: Take the disjoint
union of ˜
Cdand d2circuits of the form (2), each of which respectively computes aij for
all 1 i,jd. Then, identify the input gate labeled xij in ˜
Cdwith the gate computing
aij. The result is a weakly skew circuit Cwhich computes detdaand|C|<|˜
Cd|+2d4.
Hence Theorem 1.5.2 implies
dc(detda)|C|+1 | ˜
Cd|+2d4s(d) + 2d4=t(d).
36
3.2 Border Complexity
Let Wd:=Cd×dbe the space of d×dsquare matrices. By Proposition 3.1.2, Conjec-
ture 1.5.3 can be expressed in terms of the values d,mNwhere permis a projection
of some polynomial in detdEnd(Wd).
3.2.1 Remark. Unfortunately, detdEnd(Wd)is unattractive from a geometric point
of view because it is neither a closed nor an open set for the Euclidean or the Zariski
topology in general. We will now replace this set by its closure, which corresponds
to allowing arbitrary approximation in our computational model. This is arguably
natural from both a computational and a geometric point of view, but the implications
for the computational model are not completely understood yet.
For y={y1, . . . , yn}xand dN, the space C[y]d={PC[y]|deg(P)d}is
a finite-dimensional vector space which we endow with the Euclidean topology and
we consider on C[y]the final topology with respect to the inclusions C[y]dC[y].
Lastly, the topology on C[x]that we use is the final topology with respect to the
inclusions C[y]C[x]for all finite subsets yx.
For any complexity measure c:C[x]N, we define the corresponding border
complexity function as
c(P):=min{dNPc1([d])},
where we use the notation [d]:={1, . . . , d}and hence, c1([d]) = {P|c(P)d}. If c
measures complexity, then cmeasures approximate complexity: c(P)dmeans that
any neighbourhood of Pcontains a polynomial of complexity at most d. Clearly,
c(P)c(P)always holds.
This defines the determinantal orbit border complexity doc:C[x]Nand the
determinantal border complexity dc:C[x]N. We note that dc and doc are poly-
nomially equivalent by general principle:
3.2.2 Proposition. For two complexity measures c1and c2,c1c2implies c1c2.
Proof. There is a tpoly such that c1(P)t(c2(P)) for all PC[x]. In other
words, we have c1
2([d]) c1
1([t(d)]). If PC[x]satisfies d:=c2(P), then it follows
from an elementary topological argument that Pc1
2([d]) c1
1([t(d)]), hence
c1(P)t(d) = t(c2(P)). The statement follows by symmetry.
With Proposition 3.1.2 we obtain:
3.2.3 Corollary. doc dc.
The precise statement of the conjecture by Mulmuley and Sohoni is the following:
37
3.2.4 Conjecture ([MS01, 4.3]). doc(perm)is not polynomially bounded in m.
While the converse is not known, Conjecture 3.2.4 implies Conjecture 1.5.3 be-
cause we have doc(perm)doc(perm): If doc(perm)/poly(m), then we also have
doc(perm)/poly(m). For homogeneous polynomials such as the permanent, we can
give a more concrete description of doc. We denote by GL(Wd)End(Wd)the set of
invertible endomorphisms, the general linear group on Wd.
3.2.5 Proposition. For a homogeneous polynomial PC[x]mand any xxwith
x/supp(P), we have
doc(P) = min{dNQdetdGL(Wd):xdmPQ}.
For the proof, we require the following observations:
3.2.6 Lemma. We have detdEnd(Wd) = detdGL(Wd).
Proof. We only have to show the inclusion ”. Since the right hand side is closed, it
is in fact sufficient to show that it contains detdEnd(Wd). Consider the map
ω: End(Wd) C[x]
a detda
It is continuous because the coefficients of detdaare polynomials in the entries of (a
matrix representation of) a. Since GL(Wd)is dense in End(Wd), we have
detdEnd(Wd) = ω(End(Wd)) = ω(GL(Wd)) ω(GL(Wd)) = detdGL(Wd).
3.2.7 Lemma. For two sets P,Q C[x]of polynomials, P Q implies P Q.
Proof. Since is an equivalence relation, we can give C[x]/the quotient topology
and we claim that the quotient map π:C[x]C[x]/closed. Let yxbe a finite
subset. We denote by Sythe group of all set automorphisms of y. For P,QC[y],
we then have PQif and only if there is some πSywith P=Qπ, in the sense
of Definition 1.3.4. Hence, the equivalence class of any PC[y]is finite, from which
it follows easily that the quotient map C[y]C[y]/is closed. Since C[x]/has
the final topology with respect to πand C[x]has the final topology with respect to
the inclusions C[y]C[x]for any finite yx, it follows that πis closed. Therefore,
π(P) = π(Q)implies π(P) = π(P) = π(Q) = π(Q).
Proof of Proposition 3.2.5.Let xxand ˜
x:=x\{x}. Note that every polynomial is
equivalent to a polynomial in C[˜
x]. The map hd:C[˜
x]mC[x]dgiven by P xdmP
is linear, in particular it is continuous.
38
Claim. If doc(P)d, then there is some aEnd(Wd)with hd(P)detda.
We will prove this claim later and show first that it implies the statement. Let docm
be the restriction of doc to C[x]mand observe
doc1
m([d]) ={PC[x]m|doc(P)d}
={PC[x]m|aEnd(Wd):Pdetda}(by the claim)
{PC[˜
x]m|aEnd(Wd):hd(P) = detda}(by Lemma 3.2.7)
=h1
d(detdEnd(Wd)) (by definition)
=h1
d(detdEnd(Wd)) (hdis continuous)
=h1
d(detdGL(Wd)).(by Lemma 3.2.6)
We are left to prove our claim. Assume that doc(P)d, so there is an aEnd(Wd)
with Pdetda. Potentially replacing Pby an equivalent polynomial, we may
assume that supp(P)supp(detd). Let Ssupp(detd)and σ:SSCbe the map
with P= (detda)σ. After composing awith an appropriate linear transformation,
we can assume that σ:SC×, since mapping a variable to another variable or to
zero is a linear operation on the parameter space. Since scaling the variables by a
nonzero complex number is also a linear operation, we achieve that σ:S{1}. If
S=, then P=detdaand in particular m=deg(P) = dso we are done. Otherwise,
we can compose awith a linear projection mapping all variables in Sto one xSand
consequently replace Sby{x}. Since Pand detdaare both homogeneous, it follows
that xdmP=detda.
3.3 The Flip via Obstructions
In this section, we freely use concepts and results from representation theory and
geometric invariant theory, see Chapter A. We also expect the reader to be familiar
with elementary topology.
In light of Proposition 3.2.5 and Conjecture 3.2.4, for md, we define the padded
permanent ppd,m:=xdm
d,d·permwhich is a degree dform on the space W=Cd×d.
3.3.1 Remark. The padding arises naturally from a computational perspective, but it
causes significant problems from a geometric point of view. We will point out these
problems when they occur and conclude the discussion in Chapter 4.
3.3.2 Definition. For PC[W]d, we define (P):=PGL(W)the orbit of P. We
denote by (P)the Euclidean closure of (P). We also write Pand Pinstead
of (P)and (P), depending on which is more readable.
39
Let Wbe a finite-dimensional complex vector space and P,QC[W]dtwo homo-
geneous forms of degree don W. In the context of Conjecture 3.2.4, they play the roles
of Q=ppd,mand P=detdon W=Cd×d, for certain values of d,mN.
By Proposition 3.2.5, we are interested in a way to prove that Q/P. By The-
orem A.1.9.(1), the set Pis locally closed and the following classical result implies
that Pis also Zariski closed, hence an affine variety.
3.3.3 Theorem ([Kra85, AI.7.2]). Let Xbe a complex variety. For a constructible set
UX, the closure of Uin the Euclidean and in the Zariski topology coincide.
It is the goal to show that Q/P. Assuming the converse, we get
QPGL(W) QGL(W)PGL(W) QGL(W)PGL(W).
This would imply that there is a surjection π:C[P]C[Q]of the corresponding
coordinate rings. Via the induced action of GL(W)on these rings (see Remark A.1.5),
πis also a morphism of GL(W)-modules. Choose a basis W
=Cnand by Para-
graph A.2.8, we can write
C[P] =
λΛ+
n
V(λ)ocP(λ).
We call ocP(λ)the orbit closure coefficient of Pat λ. Since πis a GLn-equivariant
morphism, Schur’s Lemma (Lemma A.2.2) implies that ocQ(λ)ocP(λ)for all λ
Λ+
d. Hence:
3.3.4 Proposition (The Flip). Let P,QC[x1, . . . , xn]d. If there exists a λΛ+
nwith
ocQ(λ)>ocP(λ), then Q/P.
This observation is called the “flip” because it allows us to prove the non-existence
of efficient algorithms by proving the existence of certain objects.
3.3.5 Remark. It is quite unlikely that Proposition 3.3.4 is sufficient to separate Q
from Pfor general Pand Q: The GLn-module structure of the coordinate ring of an
affine variety does not define the variety uniquely, see Example 3.3.6 below. It was
kindly communicated to the author by Michel Brion and further related developments
can be found in [Bri11].
3.3.6 Example. Consider G=SL2acting on the space V=C[x,y]2of quadratic forms
in two variables by precomposition. Let C[V] = C[a,b,c]where we write a quadratic
form as a·x2+b·2xy +c·y2. The discriminant of quadratic forms gives a G-invariant
regular function :=b2ac C[V]. For any tC, the polynomial t:=tis
also G-invariant and the fibers Vt:={QV|(Q) = t}=Z(t)Vare closed
40
G-orbits in V, see [Kra85, II.3.3, Beispiel 1, p. 102]. The variety Vtis singular if and
only if t=0, so for example V1is not isomorphic to V0. However, we claim that the
G-module structure of the coordinate ring does not depend on t. Indeed, let IC[V]
be the ideal generated by t. Since tis G-invariant of degree 2, we have C[V]d2
=Id
as G-modules for all dZ, where the isomorphism is given by P P·t. Hence,
the G-module structure of Idoes not depend on the choice of tand in particular, the
resulting module structure of C[Vt] = C[V]/Iis independent of t.
3.4 Orbit and Orbit Closure
For this chapter, let Wbe a C-vector space of finite dimension with some choice of a
basis W
=Cn. Usually, n=d2and Wis a space of d×dmatrices. We write GL(W)
when coordinates are not required and GLnotherwise.
We study the variety P=PGL(W)for some homogeneous form PC[W]d.
This is the natural first step towards understanding its closure. By Theorem A.1.9,
we are interested in the stabilizer group GP:={gGL(W)|Pg=P}of P. The
following two results justify that for the rest of this section, we assume GPto be
reductive and consequently, Pis an affine variety with coordinate ring C[GL(W)]GP.
For a matrix ACd×d, we denote by Atits transpose.
3.4.1 Theorem ([Fro97]). Let W=Cd×dand let detdC[W]dbe the determinant. The
stabilizer group Gdetdis reductive of dimension 2(d21). Moreover,
(1) The identity component of the stabilizer of detdis the group
G
detd={aGL(W)|S,TSLd:BW:a(B) = SBT}.
(2) Denoting by tGL(W)the map t(B):=Bt, the group Gdetdconsists of the two
connected components G
detdand tG
detd.
3.4.2 Theorem ([Bot67;MM62]). Let W=Cm×mwith m3 and let permC[W]m
be the permanent. The stabilizer group Gpermis reductive of dimension 2(m1).
Moreover,
(1) Denoting by mSLmthe subgroup of diagonal matrices where the product of
all entries on the diagonal equals 1,
G
perm={aGL(W)|S,Tm:BW:a(B) = SBT}.
(2) We embed SmGLmas the subgroup of permutation matrices. For σ,τSm,
let cτ
σGL(W)be the map cτ
σ(B) = σBτand let Gσ,τ
perm:=cτ
σG
perm.
Denoting by tGL(W)the map t(B):=Bt, the group Gpermconsists of the 2m!2
connected components tkGσ,τ
permfor k{0,1}and σ,τSm.
41
C[P]is also a G-module and the orbit coefficients ocP(λ)are defined by
C[P] =
λΛ+
n
V(λ)ocP(λ).
Since Pis an open, affine, GLn-invariant subset of P, the restriction morphism
C[P]C[P]is an injective morphism of GLn-modules. By Schur’s Lemma
(Lemma A.2.2), we therefore have ocP(λ)ocP(λ)for all λΛ+
n. While the or-
bit closure coefficients are quite elusive, the orbit coefficients admit a nice description
when GPis reductive:
3.4.3 Proposition. For PC[x1, . . . , xn]with reductive stabilizer GPand λΛ+
n, we
have ocP(λ) = dim(V(λ)GP).
Proof. Recall from Theorems A.1.9 and A.2.7 that
C[P] = C[GLn]GP=
λΛ+
n
V(λ)V(λ)GP.
By Proposition A.2.10 and because λ λdefines an involution on Λ+
n, we get
C[P] =
λΛ+
n
V(λ)(V(λ))GP.
The statement follows by applying Proposition A.2.12 to the representation V(λ)of
the reductive group GP.
The orbit coefficients of the determinant and the padded permanent have been
analyzed in [Bür+11].
3.4.1 Characterization by the Stabilizer
It is crucial to determine whether determinant and padded permanent suffer from the
problem raised in Remark 3.3.5. The following special case of a result due to Larsen
and Pink is interesting in this context. We implicitly use Proposition 3.4.3.
3.4.4 Theorem ([LP90]). Let PC[x1, . . . , xn]dbe such that GPis semisimple and
connected. If ϱ:GPGLnis an irreducible GP-representation, then GPand ϱare
uniquely determined by the orbit coefficients ocP:Λ+
nN, up to an isomorphism of
groups and representations, respectively.
The theorem is interesting because it implies that for certain polynomials orbit
coefficients are sufficient to separate orbits. Mulmuley and Sohoni have shown that
the determinant and the permanent are characterized by their stabilizer [MS01] in the
following sense:
42
3.4.5 Definition. Let Gbe an algebraic group acting linearly on a C-vector space W.
A point PWis said to be characterized by its stabilizer if WGP=C·P.
With this notion, we have:
3.4.6 Corollary. Assume that Pand Qsatisfy the assumptions of Theorem 3.4.4, we
have Q/Pand Pis characterized by its stabilizer. Then, there exists a λΛ+
nwith
ocQ(λ)=ocP(λ).
Proof. We assume that Pand Qhave the same orbit coefficients and deduce QP.
By Theorem 3.4.4,GP=g1·GQ·g=GQgfor some gGLn, this is the definition
of the two representations being isomorphic. Hence, QgC[x1, . . . , xn]GP
d=C·P
because Pis characterized by its stabilizer. In other words, QP.
Unfortunately, the stabilizer groups of determinant and permanent are both dis-
connected, and only G
detdis semisimple, G
perm
=G2(m1)
mis a torus, therefore only
reductive. It has been established in [AYY13, Thm. 1.5] that the semisimplicity hy-
pothesis in the general theorem by Larsen and Pink [LP90] cannot be dropped.
In the literature, the map λ ocP(λ)is referred to as the dimension datum of GP.
One can study the much more general case that Gis an algebraic group and HG
is a closed subgroup. The dimension datum of this inclusion is the map Irr(G)N
given by V dim(VH). Proposition 3.4.3 states that this is equivalent to the map
ocP:Λ+
nNin our situation. The question of how much information about the
inclusion HGis encoded in the dimension datum is an active and recent area of
research, for some advances see [Yu16].
It seems that there is no version of Theorem 3.4.4 which is taylored to the situa-
tion we consider here, yet. We note however that the stabilizers of determinant and
permanent act irreducibly on W=Cd×d.
3.4.7 Proposition. The space W=Cd×dis an irreducible G
detd-representation and in
particular, an irreducible Gdetd-representation.
Proof. Since W
=CdCd, it is an irreducible (SLd×SLd)-representation [FH04, Ex-
ercise 2.36]. This is precisely the action of G
detd.
3.4.8 Proposition. The space W=Cm×mis an irreducible Gperm-representation.
Proof. Note first that W=m
i=1m
j=1C·Eij where Eij is the matrix with entry 1 in
position (i,j)and 0 everywhere else. Using notation from Theorem 3.4.2, the group
m×m
=Gm1
m×Gm1
mis a torus and C·Eij is a weight space for the corresponding
action by simultaneous left and right multiplication. This is the action of G
perm. Let
now UWbe some nonzero, Gperm-stable subspace of W. Then, Uis in particular
43
stable under G
permand must therefore be a direct sum of certain weight spaces. Hence,
there are indices iand jwith Eij U. By Theorem 3.4.2, arbitrary row and column
permutations are in Gperm, so Eij Ufor all 1i,jm. Hence, W=U.
Remark. While this sounds good, closer inspection reveals that the padded perma-
nent fails to retain the desired property. This is related to the padding variable, as
announced in Remark 3.3.1.
Let m<dbe natural numbers and Q:=ppd,mthe padded permanent. Then,
the space W=Cd×dis certainly not an irreducible GQ-representation: The one-
dimensional space of matrices with zeros everywhere except in position (d,d)is in-
variant under GQdue to the description of GQin [Bür+11, 5.6].
Remark. Note that W=Cm×mis not an irreducible G
perm-representation.
Even if we ignore all these problems for now, it remains an open question whether
the orbit closure and its embedding are uniquely defined by the orbit closure coeffi-
cients. This is the important question of whether or not Proposition 3.3.4 is sufficient:
3.4.9 Question. Let P,QC[x1, . . . , xn]dhave reductive stabilizer and Q/P. As-
sume that Pis irreducible, characterized by its stabilizer, and Cnis irreducible as
aGP-module.
(1) Does there exist a λΛ+
nwith ocQ(λ)>ocP(λ)?
(2) The same question, but under the additional assumption that Q=xdm˜
Qis the
product an irreducible polynomial ˜
QC[x1, . . . , xn]mof degree m<dwith a
power of a linear form x, such that ˜
Qis characterized by its stabilizer and Cnis
irreducible as a G˜
Q-module.
(3) The same question, but even more specifically with P=detdand Q=ppd,m.
44
Chapter 4
Occurrence Obstructions
The Flip (Proposition 3.3.4) offers a way to prove noncontainment of orbit closures.
For the polynomials P=detdand Q=ppd,mit is proposed in [MS01;MS08] to exhibit
weights λwhich satisfy ocP(λ) = 0 and ocQ(λ)=0. This implies ocQ(λ)>ocP(λ)
and therefore Q/Pby Proposition 3.3.4. Such weights λare called occurrence
obstructions.
We present in this chapter the results of the previously published [BIH17]. The
paper features a result (Theorem 4.1.2) about the nature of occurrence obstructions
which suggests that they are quite rare. In the subsequent works [IP16;BIP16],
Bürgisser, Ikenmeyer, and Panova strengthened this result considerably, eventually
proving that occurrence obstructions cannot be used to prove Conjecture 3.2.4. Their
method relies only on the fact that we consider a padded polynomial, see also Re-
mark 3.3.1. This is by far the biggest clue to date that the padding variable, while
computationally harmless, causes severe problems for the geometry.
The auxiliary Propositions 4.3.5 and 4.3.8 remain of independent interest: They
provide an unconditional version of a related statement by Kumar, even though the
bounds are not particularly impressive.
In what follows, we will denote by Dd:=(detd)and Pd,m:=(ppd,m)the orbit
closures of determinant and padded permanent. Let also n:=d2. Both Ddand Pd,m
are closed subvarieties of the N-dimensional affine space AN=C[xij |1i,jd]d
of degree dhomogeneous forms on Cd×d, where we define N:=(n+d1
d). The group
GLn
=GL(Cd×d)acts on ANby precomposition.
4.1 Weight Semigroups
We use notation from Section A.3 in this section. Let ZANbe any GLn-invariant
closed subvariety. The group GLnacts on the coordinate ring C[Z]of Zvia precom-
position. We are interested in the set of irreducible GLn-representations occurring
in C[Z]and define
Λ+(Z):={λΛ+
nV(λ)C[Z]}. (1)
45
It is known that Λ+(Z)is a finitely generated submonoid of Λ+
n, cf. [Bri87]. We are
mainly interested in the case where Z=Pis an orbit closure, in this case it is easy
to see that |λ|:=λ1+···+λndZfor any λΛ+(Z). We furthermore denote by
(λ):=min{k|λk=0}the length of λ. If Pdepends on variables only, then it is
known that (λ)for all λΛ+(Z), cf. [Bür+11, §6.3].
An occurrence obstruction is some λΛ+(Pd,m)\Λ+(Dd). If we want to prove at
least dc(perm)>m2+1 by exhibting an occurrence obstruction, then we may assume
that dm2+1. Since any λin Λ+(Pd,m)satisfies|λ|dZand (λ)m2+1d,
we have to look for such partitions λoutside of Λ+(Dd).
Before stating our main result, we need to introduce the concept of saturation,
whose relevance for geometric complexity was already pointed out in [Mul07], see
also [BOR09].
For the following compare [MS05, §7.3]. Let Sbe a submonoid of a free abelian
group FandSthe group generated by S. We call Ssaturated if
λS:kN:kλSλS.
The saturation Sat(S)of Sis defined as the smallest saturated submonoid of Fcon-
taining S. It can also be characterized as the intersection ofSwith the rational cone
generated by S. An element in Sat(S)\Sis called a gap of S. The reason for this
naming becomes apparent from a simple example:
4.1.1 Example. Consider S=N2\{(0,1),(1,0)}, which has N2as its saturation. Re-
placing Sby Sat(S)means filling up the gaps (0,1),(1,0). Generally, understanding
monoids is difficult due to the presence of gaps.
We can now state the main theorem of this chapter:
4.1.2 Theorem. The saturation of Λ+(Dd)contains the set
{λΛ+
n(λ)d,|λ| dZ},
provided that d>2. Hence, occurrence obstructions must be gaps of Λ+(Dd).
4.1.1 Related Work
Our work is closely related to a result by Shrawan Kumar [Kum15]. A latin square of
size dis an d×dmatrix with entries from {1, . . . , d}such that in each row and in each
column each number occurs exactly once. The sign of a latin square is defined as the
product of the signs of all the row and column permutations. Depending on the sign,
we can speak about even and odd latin squares. The Alon-Tarsi conjecture [AT92]
states that the number of even latin squares of size dis different from the number of
46
odd latin squares of size d, provided dis even. The Alon-Tarsi conjecture is known to
be true if d=p±1 where pis a prime [Dri98;Gly10].
4.1.3 Theorem (Kumar). Let dbe even. If the Alon-Tarsi conjecture for d×dlatin
squares holds, then dλΛ+(Dd)for all partitions λsuch that (λ)d.
The above two theorems complement each other. Theorem 4.1.2 is unconditional
and also provides information about the group generated by Λ+(Dd). Theorem 4.1.3
is conditional, but gives a very tight bound on the stretching factor, which is dif the
Alon-Tarsi conjecture holds. The proofs of both theorems focus on the the d-th Chow
variety (see Section 4.3 for details), but otherwise proceed differently. Our proof also
gives information on the stretching factor in terms of certain degrees related to the
normalization the Chow variety, but we were so far unable to bound it in a reasonable
way, see Proposition 4.3.5.
4.2 Saturations of Weight Semigroups of Varieties
We consider the following situation. G:=GLn(C)is the complex general linear group
and Udenotes its subgroup consisting of the upper triangular matrices. Vis a finite
dimensional C-vector space and a rational G-module such that scalar multiples of
the unit matrix InGact on Vby nontrivial homotheties, i.e., there is a nonzero
dZsuch that PtIn=tdPfor tC×and PV. Further, Zdenotes a G-
invariant, irreducible, locally closed nonempty subset of V. Then Zis closed under
multiplication with scalars in C×by our assumption on the G-action.
We consider the induced action of Gon the coordinate ring C[Z]of Zdefined
as in Remark A.1.5. We will denote this induced action from the left and by a dot,
i.e., (g.f)(P) = f(Pg1)for PZ,fC[Z]and gG. As in (1) we define
the monoid Λ+(Z)of representations of the G-variety Z. We shall interpret Λ+(G)
as a subset of Znand denote by Λ(Z):=Λ+(Z)the group generated by Λ+(Z).
Moreover, we denote by coneQ(Λ+(Z)) the rational cone generated by Λ+(Z), that is,
coneQ(Λ+(Z)) :={k1λkN,λΛ+(Z)}Qn.
It is easy to check that the saturation of Λ+(Z)is obtained as
Sat(Λ+(Z)) = Λ(Z)coneQ(Λ+(Z)). (2)
We denote by Frac(R)the field of fractions of an integral ring R. We have an induced
G-action on the field of fractions C(Z):=Frac(C[Z]) and denote by C(Z)Uits subfield
of U-invariants. Recall that a highest weight vector is a U-invariant weight vector. The
following lemma is well known, but we include its proof for completeness.
47
4.2.1 Lemma. We have Frac(C[Z]U) = C(Z)U. Moreover, for a highest weight vector
fC(Z)U, there exist highest weight vectors p,qC[Z]Usuch that f=p/q.
Proof. The inclusion Frac(C[Z]U)C(Z)Uis obvious. Now let fC(Z)Uand
consider the ideal J:={qC[Z]|qf C[Z]}of C[Z]. Since J=0 we have JU=0,
cf. [Hum98, §17.5]. Choose a nonzero qJU. Then p:=qf C[Z]Uand f=p/q,
hence fFrac(C[Z]U).
If fC(Z)Uis a weight vector, we can argue as before, choosing qJUas a
highest weight vector. Then p:=qf is a highest weight vector in C[Z]. The assertion
follows.
4.2.2 Remark. If Y=is a G-invariant open subset of Z, then Λ+(Z)Λ+(Y)and
Λ(Z) = Λ(Y). This follows immediately from Lemma 4.2.1.
Suppose now that Zis a closed subset of V, hence an affine variety. Then we have
an induced G-action on the normalization N(Z)and the canonical map π: N(Z)Z
is G-invariant. Indeed, the integral closure Rof C[Z]in C(Z)is G-invariant and π
corresponds to the inclusion C[Z]R. By construction, we can identify C(N(Z))
with C(Z). Note that Λ+(Z)Λ+(N(Z)) since πis surjective.
4.2.3 Proposition. We have Λ(N(Z)) = Λ(Z)and Sat(Λ+(N(Z))) = Sat(Λ+(Z)).
More precisely, assume that C[N(Z)] is generated as a C[Z]-module by relements.
Then for all λΛ+(N(Z)), there is some k<rsuch that (rk)·λΛ+(Z).
Proof. Let λΛ+(N(Z)) and fC[N(Z)]Ube a highest weight vector of weight λ.
Then fC(Z)Uand Lemma 4.2.1 shows the existence of highest weight vectors
p,qC[Z]U, say with the weights µ,νΛ+(Z), respectively, such that f=p/q.
Therefore λ=µνΛ(Z). This shows the equality for the groups.
Due to (2), it suffices to prove that coneQ(Λ+(Z)) = coneQ(Λ+(N(Z))). Suppose
fC[N(Z)] is a highest weight vector of weight λ. Since fis integral over C[Z],
there are eNand a0, . . . , ae1C[Z]such that such that
fe+
e1
i=0
aifi=0. (3)
We assume that the degree eis the smallest possible.
Note that eis at most the size of a generating set of C[N(Z)] as an C[Z]-module,
as follows from the classical theory of integral extensions, see [AM69, Prop. 5.1 and
the proof of Prop. 2.4].
Consider the weight decomposition ai=µai,µof ai, where ai,µhas the weight µ.
Then ai,µfihas the weight µ+iλ. Moreover, fehas the weight eλ. Since the component
of weight eλin (3) must vanish, we have
fe+
e1
i=0
ai,(ei)λ·fi=0.
48
As the degree eis the smallest possible, the above is the minimal polynomial of f. Ap-
plying any element uUto the above equation and using u.f=f, we get the identity
fe+e1
i=0(u.ai,(ei)λ)fi=0. The uniqueness of the minimal polynomial implies that
u.ai,(ei)λ=ai,(ei)λfor all i. Hence ai,(ei)λis a highest weight vector, provided it is
nonzero. Since there exists i<ewith ai,(ei)λ=0, we see that (ei)λΛ+(C[Z])
for this particular i. We conclude that λconeQ(C[Z]).
4.2.4 Example. If we consider instead the torus G= (C×)dwe can identify Λ+
G
with Zd. Let SZdbe a finitely generated submonoid and consider the finitely gen-
erated subalgebra C[S]:=sS(C·xs1
1···xsd
d)of C[x±1
1, . . . , x±1
d]. If we interpret C[S]
as the coordinate ring of an affine variety Z, then we have a G-action on Zand Λ+(Z)
can be identified with S(Zis called a toric variety). It is known that C[Sat(S)] equals
the integral closure of C[S]in Frac(C[S]), cf. [MS05, Prop. 7.25, p. 140]. Thus the
affine variety corresponding to Sat(S)equals the normalization of Z. This illustrates
Proposition 4.2.3 in the special case of toric varieties.
4.3 Proof of Main Results
Write W:=Cn,G:=GL(W)and consider the symmetric power V:=SymnWwith
is natural G-action. Using the chosen coordinates on W, we obtain an isomorphism
WWthat we denote by a a. We can interpret Vas the space C[W]nof
degree nforms. Note that C[V]k
=SymkSymnWas G-modules. Let x1, . . . , xnW
be a basis and consider the universal monomial
mnn:=x1···xnV.
Clearly, mnnconsists of all symmetric products a1···anof nlinearly independent
linear forms ai. We define the n-th Chow variety
Cn:=mnnEnd(W) ={a1···an|a1, . . . , anW}V.
The name comes from the fact that Cnis a special case of a Chow variety, see [GKZ94].
4.3.1 Lemma. We have Cn=mnn.
Proof. The inclusion Cnmnnfollows since GL(W)is dense in End(W). For the
converse inclusion, let Qmnnbe nonzero. By Theorem 3.3.3, it is the limit of a
sequence (tk·a1k···ank)kN, where tkC×and aik Ware linearly independent
with aik=1. By compactness of the unit sphere in Wwe may assume that after
passing to a subsequence, (aik)kNis convergent for all 1 in. Let bi:=limkaik.
Then, (n
i=1aik)kNconverges to b1···bn=0. It follows easily that (tk)kNconverges
to some tC×and consequently, Q=t·b1···bnCn.
49
When we identify xiwith the variable xii, the Chow variety Cnis contained in Dn
by mapping xij to 0 for i=j, c.f. [Lan15]. The basic strategy, as in Kumar [Kum15], is
to replace Dnby the considerably simpler Cnand to exhibit elements in the monoid of
representations of the latter. More specifically, we have Λ+(Cn)Λ+(Dn)and hence
Sat(Λ+(Cn)) Sat(Λ+(Dn)). Our main Theorem 4.1.2 is an immediate consequence
of the following result.
4.3.2 Theorem. We have Sat(Λ+(Cn)) ={λΛ+
n:|λ|nZ}, provided n>2.
According to Proposition 4.2.3, for proving this we may replace Cnby its normal-
ization. It is crucial that the latter has an explicit description.
For the following arguments compare [Bri93] and [Lan15]. We will revisit them in
Section 6.2. The symmetric group Snoperates on the group
Tn:={(t1, . . . , tn)Cn|t1···tn=1}
by permutation. The corresponding semidirect product Hn:=TnoSnacts on the
space Wn=W×. . . ×Wby scaling and permutation. Note that this action commutes
with the G-action. Consider the product map
ω:Wn=W×. . . ×W Cn
(a1, . . . , an) a
1···a
n,
which is surjective and G-equivariant. Clearly, ωis invariant on H-orbits. Moreover,
the fiber of a nonzero QCnis an Hn-orbit. This easily follows from the uniqueness
of polynomial factorization.
The group Gcontains the subgroup {t·idW|tC×}
=C×and this C×-action
induces a natural grading on the coordinate rings C[Wn]and C[Cn]. Since ωis G-
equivariant, the corresponding comorphism ω:C[Cn]C[Wn]is in particular a
homomorphism of graded C-algebras. However, the C×-action on C[Cn]is not the
canonical one because tC×acts by multiplication with the scalar tn. A homoge-
neous element of degree kn in C[Cn]is the restriction of a k-form on V.
The categorical quotient Wn//Hnis defined as the affine variety that has as its
coordinate ring the ring of Hn-invariants C[Wn]Hn, which is finitely generated and
graded since Hnis reductive, cf. [Kra85]. The inclusion C[Wn]HnC[Wn]defines
aG-equivariant, surjective morphism π:WnWn//Hn. Since Wnis normal, the
quotient Wn//Hnis normal as well, see [Dol03, p. 45] for the easy proof. The map ω
factors through a G-equivariant morphism
φ:Wn//HnCn, (4)
due to the universal property of categorical quotients. Moreover, by construction, the
fibers of φover a nonzero QCnconsist of just one element. The action of Hn
50
on C[Wn]is linear, therefore it respects the grading. It follows that the comorphism
φ:C[Cn]C[Wn]Hnis again a homomorphism of graded C-algebras.
The following is shown in [Bri93, Prop., p. 351] and we give a proof in slightly
different language in Section 6.3.
4.3.3 Lemma. The morphism φ:Wn//HnCnis the normalization of Cn.
Furthermore,
4.3.4 Lemma. We have
Λ+(Wn//Hn) ={λΛ+
nk:|λ|=kn and V
G(λ)SymnSymkCn}.
Proof. We shall decompose the coordinate ring C[Wn//Hn]with respect to the G-
action. We have C[W] = kNSymkWand therefore
C[Wn] = C[W]n=
(k1,...,kn)Nn
Symk1W···SymknW.
Taking Tn-invariants yields
C[Wn]Tn=
kN
SymkW. . . SymkW.
Taking Sn-invariants gives
C[Wn//Hn]
=C[Wn]TnoSn=
kN
SymnSymkW, (5)
and the assertion follows.
Recalling Theorem 4.1.3, we expect the stretching factor to be n. Without relying
on the Alon-Tarsi conjecture however, the following exponential bound is the best we
can currently provide.
4.3.5 Proposition. Λ+(Cn)generates the rational cone {qQn|q1 ··· qn0}.
More precisely: Assume n>2. For each partition λwith (λ)nand |λ| nZ,
there is some number k<nn22nsuch that we have 2k·λΛ+(Cn).
Proof. For the first statement, it is sufficient to show that coneQ(Λ+(Cn)) contains
any partition λwith |λ|=nk and (λ)n. According to Proposition 4.2.3, the
semigroups Λ+(Cn)and Λ+(Wn//Hn)generate the same rational cone. So we need
to show that λlies in coneQ(Λ+(Wn//Hn)).
In [BCI11] (see [Ike12] for a simpler proof) it was shown that V
G(2λ)occurs in
SymnSym2k(Cn). Thus Lemma 4.3.4 with Proposition 4.2.3 imply that λlies in the
rational cone generated by Λ+(Wn//Hn).
51
We will now make the above reference to Proposition 4.2.3 precise. Recall that the
comorphism of φfrom (4) is an integral extension φ:C[Cn]C[Wn]Hnof graded C-
algebras. Note that the grading induced by Gon these two rings is such that only
degrees which are multiples of ncontain nonzero elements. We therefore change
the grading such that degree n·kbecomes degree k. This means that the direct
sum (5) is the grading of C[Wn]Hnand the grading of C[Cn]is the grading induced
by the canonical grading induced by the polynomial ring C[V]. Therefore, C[Cn]is
generated by elements of degree one.
By [DK02, Lemma 2.4.7], there is a system of parameters y0, . . . , yrC[Cn]1from
among sufficiently generic linear forms. This means that R:=C[y0, . . . , yr]is a poly-
nomial ring in the yiand C[Cn]is a finite R-module. It follows that C[Wn]Hnis integral
over Rand y0, . . . , yris also a homogeneous system of parameters for C[Wn]Hn.
We note that r+1=dim(Wn//Hn) = dim(Wn)dim(Hn) = n2n+1, so
r=n2n. Furthermore, C[Wn]Hnis Cohen-Macaulay [DK02, Thm. 2.5.5] and by
[DK02, Prop. 2.5.3], it follows that C[Wn]Hnis free as an R-module, i.e., C[Wn]Hn
=RD
for some DN. Since Dis the number of generators of C[Wn]Hnas an R-module,
Dis a (possibly rough) upper bound for the number of generators of C[Wn]Hnas an
C[Cn]-module. The second assertion follows from Proposition 4.2.3 as soon as we
have verified that D<nn22n.
The Hilbert polynomial of RDis a polynomial of degree rwhose leading coefficient
is equal to D
r!. Since (5) gives the grading of C[Wn]Hn, the Hilbert polynomial of
C[Wn]Hnis given, for sufficiently large k, by the map
k ((k+n1
n1)+n1
n),
whose leading coefficient is 1
n!(n1)!n. Since r=n2n, we have D=(n2n)!
n!(n1)!n. We
apply Stirling’s approximation:
n>0: 1 n!
2π·en·nn+1
2e1
12n
to the fraction Dand obtain
D=(n2n)!
n!(n1)!n2π·e1
12n·(n2n)n2n+1
2·enn2
2π·nn+1
2·en·2πn·(n1)(n1
2)n·e(n1)n
=(e
2π)n
·e1
12n·nn2n+1
2·(n1)n2n+1
2
nn+1
2·(n1)n2n
2
=(( e
2π)n
·e1
12n·(n1)1n
2)
R(n)
·nn22n.
52
It is easy to see that R(n)is monotonically decreasing for n2 and takes a value
smaller than 1 for n=3. Hence, for n>2 we have D<nn22n.
We shall now determine Λ(Cn). Since Wn//Hnis the normalization of Cn, Propo-
sition 4.2.3 tells us that Λ(Cn) = Λ+(Wn//Hn). The latter is described in terms of
plethysms in Lemma 4.3.4.
Recall W=Cnand G=GL(W). We say that λoccurs in SymnSymkWif V
G(λ)
occurs as a submodule in the latter. We will also make use of the convenient notation
k×d:= (d, . . . , d, 0, . . . , 0)Znfor a rectangular partition with krows of length d.
4.3.6 Lemma. Let N. If λoccurs in SymnSymkW, then (1×k) + λoccurs in
Sym+nSymkW.
Proof. Let WU=Cw, i.e., wis a highest weight vector of Wwith weight 1 ×1. Then
wkSymSymkWis a highest weight vector of weight (1×k).
Let fSymnSymkWbe a highest weight vector of weight λ. Then the product
wkfSym+nSymkWis a highest weight vector of weight (1×k) + λ.
4.3.7 Lemma. Let nk2, d:=k(k1)/2, and the partition µof size k2be obtained
by appending to 2 ×da column of length k. Further, let λdenote the partition of size
nk obtained by appending to µa row of length (nk)k. Then the partition λoccurs
in SymnSymkW.
Proof. The GL2-module ΛkSymk1C2is one-dimensional, since Symk1C2is of di-
mension k. Hence it contains a nonzero SL2-invariant. In other words, 2 ×doccurs
in ΛkSymk1C2. The “inheritance principle” states that 2 ×doccurs in ΛkSymk1Cn
(compare for instance [Ike12, Lemma 4.3.2]).
Cor. 6.4 in [MM15] implies that µoccurs in SymkSymkW. Finally, Lemma 4.3.6
implies the assertion.
4.3.8 Proposition. Λ+(Cn)generates the group{λZn:|λ|nZ}if n>2.
Proof. Using the software [BKT12], we checked that the partition (2,2,0, . . . , 0)occurs
in Sym2Sym2Wand (6, 3,0, . . . , 0)occurs in Sym3Sym3W. Using Lemma 4.3.6, we
can conclude that (2n2, 2, 0, . . . , 0)occurs in SymnSym2W, and (3n3, 3,0, . . . , 0)
occurs in SymnSym3Wif n3. (We note that (3, 3,0, . . . , 0)does not occur in
Sym2Sym3W; this is the reason for the assumption n>2.) From Lemma 4.3.4 we
conclude that
λ(2):= (n1, 1,0, . . . , 0) = (3n3, 3,0, . . . , 0)(2n2, 2,0, . . . , 0)
lies in the group Λgenerated by Λ+(Wn//Hn). Clearly, λ(1):= (n, 0, . . . , 0)Λ.
53
For 3 knlet λ(k)Λdenote the partition from Lemma 4.3.7. Then we have
(λ(k)) = kand λ(k)
k=1. This easily implies that λ(1), . . . , λ(n)generate the group
˜
Λ:={λZn:|λ|nZ}. Since Λ˜
Λis obvious, we conclude that Λ=˜
Λ.
Proof of Theorem 4.3.2.Use (2) with Propositions 4.3.5 and 4.3.8.
4.3.9 Remark. The assumption n>2 in Theorem 4.3.2 is necessary. Indeed, we have
C2=Sym2C2, and one can show that Λ+(Sym2C2)generates the group (2Z)2, com-
pare [FH04, §11.2].
54
Part II
Orbit Closures of Homogeneous Forms
55
Chapter 5
Preliminaries
Throughout Part II, we will consider the following situation. We work over the field C
of complex numbers and W
=Cnis a C-vector space of dimension n, for which
we sometimes assume some choice of coordinates C[W] = C[x1, . . . , xn], to iden-
tify GL(W)with GLn=GLn(C). The group GL(W)acts on the space C[W]d=
SymdWof homogeneous forms of degree d>0 on Wby precomposition from the
right:
C[W]d×GL(W) C[W]d
(P,g) Pg
We view V:=C[W]d
=ANas an affine space of dimension N=(n+d+1
d). Note
that C[V]is simply a polynomial ring in the coefficients of all homogeneous degree d
polynomials in nvariables. We will study subvarieties of this space in the language
of algebraic geometry, therefore we usually assume the Zariski topology on V.
This scenario is motivated by the GCT approach that we outlined in Chapter 3. In
that context, one is interested only in special cases, primarily W=Cd×dand the point
P=detd. We put this into a slightly more general context here, one reason being
that we have very little understanding of the determinant orbit closure, even for small
values of d. For what little we know, see Chapter 8. To gain a better understanding of
the phenomena that occur, we believe that easier examples should be studied first.
Another reason is also that the generality we present here offers an interesting and
little explored mathematical problem: Classical geometric invariant theory offers the
tools to study the entire orbit structure of V, but a classification seems out of reach
for n>3 and d>4. We on the other hand pick specific forms PVand study only
the orbit structure of P, or even just the orbits that are of maximal dimension in a
component of P. In this chapter, we make some general observations and elaborate
on methods that we will apply in the chapters to come.
Unless explicitly stated otherwise, we only consider points PVwhose stabilizer
GP={gGL(W)|Pg=P}is reductive, mostly because it simplifies the theory
57
considerably and the condition is satisfied in our cases of primary interest, recall
Theorems 3.4.1 and 3.4.2.
We write P:=PGL(W)Vfor the orbit of Pand denote by PVits closure
as in Definition 3.3.2. Recall also that by Theorem 3.3.3,Pis both the Euclidean and
the Zariski closure of P. An element of Pis called a degeneration of P.
5.1 Conciseness
Let us call a form PV=C[W]dconcise if it is not stabilized by any noninvertible
endomorphism, i.e.
aEnd(W):(Pa=P)(aGL(W)).
We will give other characterizations of this property here, for example it means that
there is no choice of coordinates C[W] = C[x1, . . . , xn]such that PC[x1, . . . , xn1].
Informally, this may be expressed as Puses all variables”. We will also see that being
concise is an open condition in Proposition 5.1.2.
5.1.1 Definition. Let xW=C[W]1and wW. We define the partial derivative of
xdC[W]din direction wto be the form wxd:=d·x(w)·xd1, c.f. [Lan12, eq. (2.6.6)].
By linear extension, this defines wPfor any PC[W]dbecause C[W]d=SymdW
is spanned by powers of linear forms as a vector space [Mar73, Theorem 1.7]. If we
have chosen coordinates C[W] = C[x1, . . . , xn], we set iP:=eiPfor 1 in, where
eiWis the dual basis vector of xiW.
5.1.2 Proposition. Let Wbe a vector space of finite dimension. A form PC[W]dis
concise if and only if the linear map WC[W],w wPis injective.
Proof. The case d=0 is trivial, a constant form Pis stabilized by any endomorphism,
hence it is concise if and only if GL(W) = End(W), which is equivalent to W={0}.
The case d1 follows from Lemma 5.1.6 below for the identity map a=idW.
5.1.3 Remark. In particular, not being concise is a polynomial condition on the coeffi-
cients of PC[W]d: It is given by the vanishing of all maximal minors of the linear
map WC[W]d1,w wP. Hence, the set of all concise forms is a Zariski open
subset of C[W]d.
5.1.4 Corollary. Let PC[x1, . . . , xn]d. Then, Pis concise if and only if the partial
derivatives 1P, . . . , nPare linearly independent.
58
5.1.5 Remark. After choosing coordinates, a polynomial PC[x1, . . . , xn]satisfies
xisupp(P)if and only if iP=0. Hence, Pis not concise if and only if there is
a choice of coordinates such that one of the variables does not appear in the support
of Pat all.
For aEnd(W), we denote by rk(a):=dim(a(W)) its rank.
5.1.6 Lemma. Let PC[W]dwith d1. For aEnd(W), we consider the linear
map δa:WC[W],w w(Pa). The following conditions are equivalent:
(1) rk(a)rk(δa).
(2) For any bEnd(W)with Pa=Pb, we have rk(a)rk(b).
Proof. We choose coordinates C[W] = C[x1, . . . , xn]and set Q:=Pa. Denote by
(Q):= (1Q, . . . , nQ)C[x1, . . . , xn]1×n
d1
the (row) vector of partial derivatives of Q. Let v1, . . . , vkCnbe a basis of the vector
space {vCn|(Q)·v=0}, so r:=nkis the dimension of the C-vector space
spanned by the iQ, i.e., r=rk(δa).
Assume that (2) holds. We have to show that rk(a)r. Let gGL(W)be an
invertible linear transformation that maps the i-th standard basis vector eiCnto vi
for 1 ik. Denoting by a dot the product of matrices, the chain rule yields
i(Qg) = (Qg)·ei= ((Q)·vi)g=0 for 1 ik.
This implies that QgC[xk+1, . . . , xn]. We consider the linear map cEnd(W)
which maps xic=0 for 1 ikand xjc=xjfor j>k. Then, Qgc=Qg,
therefore Pagcg1=Q=Pa. It follows that r=rk(c)rk(agcg1)rk(a)by
our assumption.
Conversely, assume that (1) holds, so we have rk(a)r. Let bEnd(W)be such
that Pa=Pb. For any vker(b), the chain rule yields
(Pa)·v=(Pb)·v= ((P)b)·b·v=0,
so any vker(b)is a linear relation among the partial derivatives of Pa. In other
words, k=dim(ker(δa)) dim(ker(b)). Therefore,
rk(a)r=nkndim(ker(b)) = rk(b).
59
5.2 Grading of Coordinate Rings and Projectivization
The next observation is that Pand Pare both affine cones in the following sense:
5.2.1 Definition. A subset CVis called an affine cone if PC:C·PC.
Note that GL(W)contains a central copy of Gm(C) = C×in the form of scalar
maps C×·idW, where idWdenotes the identity map. Given QPC[W]dand
any tC, let ζCbe some d-th root of t. Then, tQ =ζdQ=QζidWQ=P.
Since scalar multiplication is continuous, we have the following general principle.
5.2.2 Lemma. If CVis an affine cone, then Cis also an affine cone.
Proof. The reductive group Gm= (C×,·)acts algebraically on Vby scalar multiplica-
tion. Denote by α:Gm×VVthe action morphism. Since Gm×C=Gm×C, we
have α(Gm×C)α(Gm×C) = C. Hence, Cis Gm-invariant.
This means that Pis an affine cone. In particular, the coordinate ring C[P]
inherits a N-grading from C[V]. On the other hand, C[GLn]has a natural Z-grading
that is the result of localizing at the SLn-invariant polynomial detn. The coordinate
ring C[P] = C[GLn]GPinherits this grading. By Proposition A.2.9,
C[P]d=
λΛ+
n
|λ|=d
V(λ)V(λ)GP.
We will now see that under reasonable assumptions, C[P]is also the localization
of C[P]at some SL(W)-invariant and the grading of C[P]is the same as the natural
one induced by this localization. We call a form PC[W]dpolystable if PSL(W)
is Zariski closed in C[W]d. Both the permanent and the determinant are polystable
[BI15, 2.9]. Furthermore, Bürgisser and Ikenmeyer proved:
5.2.3 Theorem ([BI15, 3.9]). If PV=C[W]dis polystable, then there exists a homo-
geneous invariant fC[P]SL(W)such that C[P] = C[P][ f1]. In particular,
P={QPf(Q)=0}.
Furthermore, if fis homogeneous of minimal degree with this property, then fis
unique up to scalar.
Remark. Bürgisser and Ikenmeyer also study the minimal degree and other numerical
quantities related to these invariants, for the determinant and several other interesting
families of polynomials.
60
Studying the geometry of Pis equivalent to studying its projectivization P(P),
which is a projective variety. Furthermore, we know that P(P)is a smooth, open
subset of P(P)which we understand well. While this is quite promising, it turns
out [BI15, 3.10,3.17,3.29] that Pis not a normal variety in most cases and in partic-
ular, when Pis detnor pernwith n3. It follows that in all these cases, Phas
singularities. Since Pis smooth, these singularities will be part of the boundary
P:=P\P, which is therefore a subvariety that needs to be analyzed further.
We note first that it is a hypersurface in our cases of interest:
5.2.4 Lemma. If GPis reductive, then Pis pure of codimension one in P.
Proof. The complement of an affine variety is always pure of codimension one [Gro67,
Corollaire 21.12.7]. The statement follows because for reductive GP, the orbit Pis
affine by Theorem A.1.9.
5.3 Rational Orbit Map
The orbit map ωP: End(W)Pgiven by a Pais line-preserving, therefore it
can be viewed as a rational map
ϖP:PEnd(W) PP(1)
[a] [Pa]
Since it can happen that Pa=0 for a=0, the map ϖPis not defined everywhere.
If it was defined everywhere, it would be a dominant, projective morphism, therefore
surjective [Sha94, I.5.2]. We will transform ϖPinto a projective morphism by two
steps, the first of which is explained in this section. In Chapter 6we study the special
case where we obtain a morphism after the first step already. The second step is
postponed until Section 7.3 and we will see how to combine both steps in the final
two chapters.
5.3.1 Definition. For a form PC[W]d, we define
AP:={aEnd(W)|Pa=0}(2)
and call it the annihilator of P. It is an affine cone in End(W)whose projectivization
is precisely the set of points where ϖP:PEnd(W)99K PPis not defined.
5.3.2 Remark. Another way to see the annihilator is as
AP={aEnd(W)|a(W)Z(P)},
61
the set of all endomorphisms whose image is a linear subspace of the hypersurface
Z(P)W. We will call a linear subspace LZ(P)maximal if there is no other linear
subspace of Z(P)which properly contains L. With this notion, APis the set of all
aEnd(W)whose image is contained in some maximal linear subspace of Z(P).
The stabilizer GP={gGL(W)|Pg=P}acts by left multiplication on End(W)
and ωPis GP-invariant with respect to this action, since ωP(ga) = Pga=Pa
for all gGP. This action induces an action on PEnd(W)and consequently, the
rational map ϖPfrom (1) is invariant with respect to this induced action. We stress
again that GPis assumed reductive unless explicitly stated otherwise.
Consider now a map aEnd(W)with 0 GPa, i.e., there is a sequence (gn)nN
with gnGPsuch that limngna=0. Such points aEnd(W)are called
unstable with respect to the action of GPin the sense of geometric invariant theory
[MFK94], see also [New78;Kra85;Dol03].
5.3.3 Definition. Let Pbe a homogeneous form. The set of unstable endomorphisms
with respect to the action of GPis called the nullcone of Pand we denote it by
NP:={aEnd(W)0GPa}. (3)
See Subsection A.1.2 for other, equivalent definitions of the nullcone.
A central observation is the following:
5.3.4 Proposition. We have NP AP. In other words, if aEnd(W)is unstable with
respect to the left action of GP, then Pa=0. This statement holds regardless of
whether or not GPis reductive.
Proof. Let a NP, meaning 0 GPa. Since ω1
P(Pa)is a closed subset of End(W)
containing GPa, it also contains its closure. Thus, 0 GPaω1
P(Pa), which
means Pa=P0=0.
We will now explain that the points in NPinduce “harmless” indeterminacies
of ϖP. We require the results from Subsection A.1.2, in particular the notion of
semistability. In our situation,
End(W)ss :=End(W)\NP={aEnd(W)0 /GPa}
is the set of semistable endomorphisms and PEnd(W)ss PEnd(W)is an open
subvariety of PEnd(W)which admits a categorical quotient, see Proposition A.1.11.
An important property of this quotient is the fact that
PEnd(W)ss//GP=Proj(C[End(W)]GP)
62
is a projective variety, even though PEnd(W)ss is only quasi-projective in general.
The first step for transforming ϖPinto a morphism is the following proposition,
which follows immediately from Propositions A.1.12 and 5.3.4.
5.3.5 Proposition. The domain of definition of ϖP:PEnd(W)99K PPis contained
in the open subset PEnd(W)ss. Furthermore, there is a commutative diagram
PEnd(W)ss
π
ϖPPP
PEnd(W)ss//GP
φP
and PEnd(W)ss//GPis a projective variety.
Remark. We can think of the rational map φPas having less indeterminacy than ϖP
because NPhas been removed before passing to the quotient.
For PC[W]d, the stabilizer GPGL(W)acts on the vanishing set
Z(P):={wW|P(w) = 0}W,
because P(w) = 0 implies P(g(w)) = (Pg)(w) = P(z) = 0 for any gGP. In par-
ticular, GPacts on the set of linear subspaces of Z(P). We will often study this action,
in particular to determine the structure of GPin some cases. In light of Remark 5.3.2,
we therefore introduce the following terminology:
5.3.6 Definition. For aEnd(W), we denote by im(a):=a(W)its image. A linear
subspace LZ(P)is called semistable if there is some aEnd(W)ss with L=im(a).
Otherwise, Lis called unstable.
5.3.7 Remark.
(1) For any a NP, the space L:=im(a)is unstable: If some bEnd(W)satisfies
L=im(b), then there is a gGL(W)with b=agand since gis an automor-
phism of End(W), we have 0 GPag=GPag=GPb.
(2) Subspaces of unstable linear subspaces are unstable. Indeed, let a NPand
Lim(a)a subspace. Then, L=im(ap), where pEnd(W)is some projection.
Since 0 GPapGPap, the space Lis also unstable.
63
Chapter 6
Closed Forms
If PC[W]dsatisfies P=PEnd(W), we call it closed. This is equivalent to the
fact that ωPhas closed image.
6.0.1 Remark. If Pis closed, then Pis always irreducible: It is the image under
ωP: End(W)Pof the irreducible hypersurface S End(W)of noninvertible
endomorphisms.
6.0.2 Example. Every quadratic form is closed. In fact, if there is some symmetric
matrix bCn×nsuch that P(x) = xtbx, then for any endomorphism a, the form Pa
is given by the matrix atba. We may assume that Pis concise, i.e. bis invertible.
It follows that{atba aCn×n}={ataaCn×n}is the space of symmetric ma-
trices, hence PEnd(Cn) = C[x1, . . . , xn]2. In particular, this is equal to Pand Pis
closed.
6.1 A Sufficient Criterion
We know only very few examples of closed forms of higher degree. A classification
remains an interesting open problem, we can only give some partial results and pose
several questions. Recall the definitions of APand NPfrom Definitions 5.3.1 and 5.3.3.
A sufficient criterion for being closed is the following:
6.1.1 Proposition. Let PC[W]dbe a form with reductive stabilizer GPGL(W). If
we have AP=NP, then Pis closed.
Proof. Recall the definition of ϖPfrom (1) and Proposition 5.3.5. The assumption
means that ϖPis defined everywhere on PEnd(W)ss and so the induced rational
map φP:PEnd(W)ss//GPPPis in fact a morphism. Since it is dominant and
projective, its image is equal to PP. The statement follows because ϖPhas the same
image as φP.
6.1.2 Corollary. If the hypersurface Z(P)has no semistable linear subspaces, then P
is closed.
65
In Proposition 5.3.4, we saw NP AP. Proposition 6.1.1 states that Pis closed if
this inclusion is an equality. Having no counterexample but also no wide selection of
examples in general, we ask the following question:
6.1.3 Question. Does NP=APhold for every closed form P?
We know of no counterexample for Question 6.1.3, but for two examples of closed
forms that are already well-known, we illustrate that NP=AP.
6.1.4 Example. We know from Example 6.0.2 that concise quadratic forms are closed.
We claim that NP=APfor any such PV=C[x1, . . . , xn]2. We may assume
without loss of generality that P=x2
1+···+x2
nbecause any concise quadratic form
is an element of P. On the one hand, we observe that
AP={aCn×nPa=0}={aCn×n|ata=0}.
The stabilizer of Pis the orthogonal group On={aGLnat=a1}. As before,
we consider the operation of Onon Cn×nby left multiplication. By the First Fun-
damental Theorem for the Orthogonal Group [Pro06, 11.2, Theorem on p. 390], the
On-invariant functions on Cn×nare generated by the entries of the map a ata, and
by Lemma A.1.10 we therefore also have NP={aCn×nata=0}.
6.1.5 Example. Consider P:=z(y2+xz)C[x,y,z]3. The classification of ternary
cubic forms [KM02] implies that Pis closed. We will show that AP=NP. By
Example 2 in said reference, the stabilizer GPGL3is generated by the matrices
uα:=
12αα2
0 1 α
0 0 1
and tε:=
ε40 0
0ε0
0 0 ε2
.
Note that U:={uα|αC}
=(C,+) is a normal unipotent subgroup of GPwhile
the group T:={tε|εC×}
=(C×,·)is reductive. More precisely, tεuαtε1=uε3α.
We have GP=TnU, the unipotent radical of GPis equal to Uand in particular GP
is not reductive and we cannot appeal to Proposition 6.1.1 to show that it is closed.
However, the definitions of NPand APdo not require Pto have reductive stabilizer.
Observe that Z(P) = Z(z)Z(y2+xz). The maximal linear subspaces of Z(P)are
therefore the plane Z(z)and any line in Z(y2+xz). We first note that Z(z)is unstable
because for any wZ(z), we have tε(w)0 as ε0. Therefore, we are left to show
that any wZ(y2+xz)\Z(z)spans an unstable line. After applying an appropriate
scaling matrix tε, we can assume that w= (y2,y,1). Consider
gε:=tε1uε3y=1
ε4·
12(ε3y)(ε3y)2
0ε3ε3(ε3y)
0 0 ε6
.
A straightforward computation shows that gε(w) = (ε2,ε2,ε2)0 as ε0.
66
6.2 Normalizations of Orbit Closures
For almost every family of homogeneous forms studied in the context of GCT, the or-
bit closures are not normal varieties [Kum15;BI11]. In Section 4.3, we saw a prominent
example of an orbit closure whose normalization has a nice representation-theoretic
description: For this section, we choose coordinates C[W] = C[x1, . . . , xn]and con-
sider the n-form mnn:=x1···xn, the universal monomial. It is also the top symmet-
ric form. The variety Cn=(mnn)is called the n-th Chow variety. In Chapter 4we
also used the notation Dn=(detn), and we recall that CnDnbecause mnnis the
restriction of detnto diagonal matrices.
By Lemma 4.3.1, the universal monomial is closed. Again, we observe that it
actually satisfies the condition we discussed in the previous section:
6.2.1 Proposition. We have Nmnn=Amnnfor all n2.
Proof. We fix n2 and let P:=mnn. We show that any aEnd(W)with Pa=0
satisfies 0 GPa. The group GPcontains all diagonal matrices t=diag(t1, . . . , tn)
with 1 =t1···tn. Assume that aCn×n
=End(W)satisfies Pa=0 and let
ai:=xia=n
j=1aijxj, then we have 0 =Pa=a1···an. Since C[W]is an integral
domain, it follows that there must be some iwith ai=0. Without loss of generality, we
may assume that a1=0, i.e., the first row of acontains only zeros. The diagonal matrix
tε:=diag(ε1n,ε, . . . , ε)stabilizes Pand tεa=ε·a. We can view tεas a regular map
t:CEnd(W)mapping ε tε. Since t(C×)GP, we have t(C×)aGPa.
Passing to the closure, we get 0 =t0aGPa.
The following theorem is well-known. It is implied by the proof of Lemma 4.3.3,
but we will also obtain it as a corollary of the upcoming Theorem 6.2.3 we only
quote it here to provide context. Refer to Section A.3 for the notion of the polynomial
part V0of a GL(W)-module V.
6.2.2 Theorem. Let ν: N(Cn)Cnbe the normalization of the Chow variety. The
morphism νcorresponds to the inclusion C[N(Cn)] = C[Cn]0C[Cn].
Arguably, Theorem 6.2.2 and Remark 6.0.1 motivated Landsberg [Lan15] to ask
the following question: Is it true that whenever a GL(W)-orbit closure with reductive
stabilizer has an irreducible boundary, the coordinate ring of the normalization of the
orbit closure equals the polynomial part of the coordinate ring of the orbit?
The answer is negative, as we recently showed [Hü17]. We will instead prove that
C[N(P)] = C[P]0is equivalent to AP=NP. Furthermore, we illustrate that
these equivalent conditions can be violated in cases where Pis irreducible. The
main theorem of this chapter is the following, its proof is postponed to Section 6.3.
67
6.2.3 Theorem. Let Vbe a polynomial right-GL(W)-module where we denote the
action by V×GL(W)V,(P,g) Pg. Let PVbe a point with reductive
stabilizer GGL(W). Denote by =PGL(W)its orbit and by ν: N()the
normalization of its orbit closure.
There is a canonical, injective homomorphism ι:C[N()] C[]0of graded
C-algebras and GL(W)-modules, and the following statements are equivalent:
(1) The injection ιis an isomorphism.
(2) For all aEnd(W)with Pa=0, we have 0 Ga.
(This means AP=NPin the case of homogeneous forms)
If either condition is satisfied, we have PEnd(W) = .
Note that Theorem 6.2.3 implies Theorem 6.2.2 by Proposition 6.2.1. Furthermore,
we can use Theorem 6.2.3 to quickly deduce that the orbit of an elliptic curve is a
counterexample for the original question by Landsberg:
6.2.4 Proposition. Let W=C3and let PC[W]3be any form that defines a non-
singular curve in P2. Let ν: N(P)Pbe the normalization of P. Then, the
stabilizer of Pis reductive, Pis irreducible and C[N(P)] is not isomorphic to
C[P]0as GL(W)-modules.
For the proof, we require some classical results about ternary cubics to deduce that
the stabilizer of Pis reductive and the boundary of its orbit irreducible:
6.2.5 Lemma. If PC[W]3defines a smooth curve, it has a finite (hence reductive)
stabilizer and P=Qis the orbit closure of a concise form Q. In particular, P
is irreducible.
Proof. By [KM02, Corollary 1], the stabilizer of Pis finite. Any finite group is re-
ductive by Maschke’s Theorem. A complete diagram of the degeneracy behaviour of
all ternary cubic forms can be found in Section 4 of [KM02]. Choosing coordinates
C[W] = C[x,y,z], the diagram implies that Pis equal to the orbit closure of the
polynomial Q:=x3y2z, regardless of the choice of P. One can compute that Qis
concise by means of Corollary 5.1.4. Hence, P=Q. This variety is irreducible be-
cause it is the closure of the image of the irreducible variety GL(W)under the regular
map GL(W)C[W]3,g Qg.
Proof of Proposition 6.2.4.By Lemma 6.2.5, the polynomial Phas a reductive stabilizer
and its orbit has an irreducible boundary. Let [w]P2be any point on the curve
defined by P, i.e., wWis nonzero and P(w) = 0. Let aEnd(W)be of rank
one such that im(a)is spanned by w. Then, Pa=0 and since GPis a finite
group, GPa=GPadoes not contain the zero map. Hence by Theorem 6.2.3,
the two GL(W)-modules C[N(P)] and C[P]0are not isomorphic.
68
This counterexample also yields another important observation:
6.2.6 Corollary. Let S:=End(W)\GL(W)be the hypersurface of noninvertible endo-
morphisms. Then, ωP(S)Pis not an irreducible component of Pin general.
Proof. With Pas in Lemma 6.2.5, we know that Zariski almost every element of P
is concise, so the image of ωP(S)cannot be dense in P.
6.2.1 The Aronhold Hypersurface
As an example for Proposition 6.2.4, we consider the special case of the Fermat cu-
bic: Let W=C3,V=C[W]3=C[x,y,z]3and P:=x3+y3+z3Vfor the rest of
this subsection. We will write and instead of Pand Pto simplify notation.
By Theorem 6.2.3 and Proposition 6.2.4, we know that the quotient C[]0/C[N()]
exists and that it is a nontrivial GL3-module, so it decomposes as a direct sum of
irreducible GL3-modules. We will explicitly compute some of the corresponding mul-
tiplicities.
Note that this is a special case as the orbit closure Vis a normal variety.
Indeed, this follows because is the third secant variety of the 3-uple veronese em-
bedding P1P3, which is projectively normal by [IK99, Thm. 1.56]. This simplifies
the calculation because we need not determine the normalization of . Note that if P
is a generic regular cubic, is not normal [BI15, Cor. 3.17 (1)].
Pdefines the elliptic curve with j-invariant equal to zero [Stu93, 4.4.7,4.5.8]. Its
orbit closure is the hypersurface defined by the Aronhold invariant αC[V]4, see
[BI15, §3.2.1] for an explicit description. Thus, C[N()] = C[] = C[V]/α. We
write in general
SymdSymmC3=
λΛ
V(λ)p(d,m;λ),
the coefficients p(d,m;λ)are also known as plethysm coefficients. In this case, we
have m=3.
The Aaronhold invariant αis a degree 4 polynomial and up to scaling, the unique
highest weight vector in C[V]of weight (4,4,4)with respect to the action of GL(W)
=
GL3(C). This means that the linear span of the GL3-orbit of αis isomorphic to the
irreducible GL3-module V((4,4,4)). We denote by Λ:=Λ+
3the set of dominant
weights of GL3(C), see Paragraph A.2.8. The degree 4 part of the homogeneous ideal
generated by αthen decomposes as
αd=C[V]d4·α
=
λΛ
V(λ)p(d4,3; λ(4,4,4))
and we get
C[]d
=λΛV(λ)bλwhere bλ=p(d,3; λ)p(d4,3; λ(4,4,4)).
69
aλbλmλλ
1 1 0 (4,2,0)
2 1 1 (6,0,0)
1 1 0 (4,4,1)
1 1 0 (5,2,2)
2 1 1 (6,3,0)
2 1 1 (7,2,0)
1 0 1 (8,1,0)
3 1 2 (9,0,0)
1 1 0 (6,4,2)
2 1 1 (6,6,0)
1 1 0 (7,3,2)
1 1 0 (7,4,1)
1 0 1 (7,5,0)
1 1 0 (8,2,2)
1 0 1 (8,3,1)
3 1 2 (8,4,0)
1 0 1 (9,2,1)
3 1 2 (9,3,0)
4 1 3 (10,2,0)
2 0 2 (11,1,0)
4 1 3 (12,0,0)
1 1 0 (6,6,3)
1 1 0 (7,6,2)
1 1 0 (8,4,3)
1 1 0 (8,5,2)
2 1 1 (8,6,1)
1 0 1 (8,7,0)
2 2 0 (9,4,2)
1 0 1 (9,5,1)
4 1 3 (9,6,0)
2 1 1 (10,3,2)
3 1 2 (10,4,1)
4 1 3 (10,5,0)
2 1 1 (11,2,2)
2 0 2 (11,3,1)
5 1 4 (11,4,0)
2 0 2 (12,2,1)
6 1 5 (12,3,0)
6 1 5 (13,2,0)
4 0 4 (14,1,0)
5 1 4 (15,0,0)
1 1 0 (6,6,6)
1 1 0 (8,6,4)
aλbλmλλ
1 1 0 (8,8,2)
2 2 0 (9,6,3)
1 1 0 (9,7,2)
1 0 1 (9,8,1)
1 0 1 (9,9,0)
1 1 0 (10,4,4)
1 1 0 (10,5,3)
3 2 1 (10,6,2)
3 1 2 (10,7,1)
4 1 3 (10,8,0)
1 1 0 (11,4,3)
3 2 1 (11,5,2)
4 1 3 (11,6,1)
3 0 3 (11,7,0)
1 0 1 (12,3,3)
4 2 2 (12,4,2)
3 0 3 (12,5,1)
9 2 7 (12,6,0)
3 1 2 (13,3,2)
5 1 4 (13,4,1)
7 1 6 (13,5,0)
3 1 2 (14,2,2)
4 0 4 (14,3,1)
9 1 8 (14,4,0)
3 0 3 (15,2,1)
9 1 8 (15,3,0)
1 0 1 (16,1,1)
9 1 8 (16,2,0)
5 0 5 (17,1,0)
7 1 6 (18,0,0)
1 1 0 (9,6,6)
1 1 0 (9,8,4)
1 1 0 (10,6,5)
1 1 0 (10,7,4)
2 2 0 (10,8,3)
2 1 1 (10,9,2)
2 1 1 (10,10,1)
2 2 0 (11,6,4)
1 1 0 (11,7,3)
3 2 1 (11,8,2)
2 0 2 (11,9,1)
2 0 2 (11,10,0)
1 1 0 (12,5,4)
aλbλmλλ
4 3 1 (12,6,3)
4 2 2 (12,7,2)
5 1 4 (12,8,1)
6 1 5 (12,9,0)
1 1 0 (13,4,4)
2 1 1 (13,5,3)
6 3 3 (13,6,2)
5 1 4 (13,7,1)
8 1 7 (13,8,0)
3 1 2 (14,4,3)
5 2 3 (14,5,2)
8 1 7 (14,6,1)
9 1 8 (14,7,0)
1 0 1 (15,3,3)
6 2 4 (15,4,2)
6 0 6 (15,5,1)
13 2 11 (15,6,0)
5 1 4 (16,3,2)
8 1 7 (16,4,1)
12 1 11 (16,5,0)
4 1 3 (17,2,2)
6 0 6 (17,3,1)
13 1 12 (17,4,0)
5 0 5 (18,2,1)
13 1 12 (18,3,0)
1 0 1 (19,1,1)
12 1 11 (19,2,0)
8 0 8 (20,1,0)
8 1 7 (21,0,0)
1 1 0 (10,8,6)
1 1 0 (10,9,5)
1 1 0 (10,10,4)
1 1 0 (11,8,5)
1 1 0 (11,9,4)
1 1 0 (11,10,3)
2 2 0 (12,6,6)
1 1 0 (12,7,5)
3 3 0 (12,8,4)
3 2 1 (12,9,3)
4 2 2 (12,10,2)
2 0 2 (12,11,1)
4 1 3 (12,12,0)
2 2 0 (13,6,5)
aλbλmλλ
2 2 0 (13,7,4)
4 3 1 (13,8,3)
5 2 3 (13,9,2)
5 1 4 (13,10,1)
4 0 4 (13,11,0)
4 3 1 (14,6,4)
4 2 2 (14,7,3)
7 3 4 (14,8,2)
7 1 6 (14,9,1)
9 1 8 (14,10,0)
2 1 1 (15,5,4)
6 3 3 (15,6,3)
7 3 4 (15,7,2)
9 1 8 (15,8,1)
11 1 10 (15,9,0)
2 1 1 (16,4,4)
4 1 3 (16,5,3)
10 3 7 (16,6,2)
10 1 9 (16,7,1)
15 2 13 (16,8,0)
4 1 3 (17,4,3)
8 2 6 (17,5,2)
12 1 11 (17,6,1)
14 1 13 (17,7,0)
2 0 2 (18,3,3)
9 2 7 (18,4,2)
10 0 10 (18,5,1)
20 2 18 (18,6,0)
7 1 6 (19,3,2)
11 1 10 (19,4,1)
17 1 16 (19,5,0)
5 1 4 (20,2,2)
9 0 9 (20,3,1)
19 1 18 (20,4,0)
7 0 7 (21,2,1)
17 1 16 (21,3,0)
2 0 2 (22,1,1)
16 1 15 (22,2,0)
10 0 10 (23,1,0)
10 1 9 (24,0,0)
. . . . . . . . .
Figure 6.2.1: Multiplicities in C[]0/C[N()] for the Fermat cubic, up to degree 8.
70
Note that bλcan be computed with the software package [BKT12].
Denoting by aλthe coefficients such that C[]0=λΛV(λ)aλ, we are inter-
ested in the numbers mλ:=aλbλ, because:
C[]0/C[] = λΛV(λ)aλbλ.
It follows from the Peter-Weyl Theorem (Theorem A.2.7) that aλ=dim(V(λ)G),
where GGL3is the stabilizer of P. It is well-known [BI15, Prop. 2.4] that Gconsists
of permutation matrices and diagonal matrices whose diagonal entries are third roots
of unity. A matrix representation of the canonical projection V(λ)V(λ)Gwith
respect to the basis of semistandard Young tableaux (SSYT) is obtained by symmetriz-
ing each SSYT with respect to Gand straightening it [Ful97, § 7.4]. The quantity aλ
arises as the rank of this matrix. Using this method, we have computed the values of
the mλ:=aλbλup to degree 8, see Figure 6.2.1 on page 70.
6.2.7 Remark. A formula for aλis more involved than the one for bλ. Advancing
methods used in [BI11, Section 4.2] (see also [Ike12, Section 5.2]), Ikenmeyer [Ike16]
determined such a formula: For λΛ, denote by |λ|:=λ1+λ2+λ3the sum of its
entries. We have aλ=0 unless|λ|=3dfor some dN. In this case,
aλ=
µΛ
|µ|=d
ν1,...,νdΛ,
|νk|=3·k·ˆ
µk
for all k
cλ
ν1,...,νd·
d
k=1
p(ˆ
µk,3k;νk), (1)
where cλ
ν1,...,νddenotes the multi-Littlewood-Richardson coefficient and ˆ
µkdenotes the
number of times that kappears as an entry of µ.
6.3 Proof of Main Theorem
This section contains the proof of Theorem 6.2.3. We require results from Section A.3
and Subsection A.1.1.
Note that the stabilizer Gacts on the variety End(W)by multiplication from
the left. Since Vis a polynomial GL(W)-module, there is a well-defined morphism
ω: End(W),a Pa. For hGand aEnd(W),
ω(ha) = P(ha) = Pha=Pa=ω(a).
Therefore, ωis an G-invariant morphism. Since Gis a reductive algebraic group,
there is an affine quotient variety End(W)//Gtogether with a surjective morphism
71
π: End(W)End(W)//Gand ωfactors as a morphism φ: End(W)//Gsuch
that the following diagram commutes:
End(W)
π
ω
End(W)//G
φ
(2)
Furthermore, the variety End(W)//Gis a normal variety because End(W)is normal
[TY05, 27.5.1].
The morphism φis a birational map. Indeed, End(W)//Gcontains GL(W)//G
as an open subset [TY05, 27.5.2] and the restriction of φto this open subset is the
isomorphism GL(W)//G
=, [TY05, 25.4.6]. Since is an open subset of its closure
[TY05, 21.4.3], this proves that φis generically one to one.
The normalization ν: N()is a surjective, finite morphism of affine alge-
braic varieties [GW10, Proposition 12.43 and Corollary 12.52]. By the universal prop-
erty of the normalization [GW10, Corollary 12.45] there exists a unique morphism
ψ: End(W)//GN()which completes (2) to a commutative diagram:
End(W)
π
ω
End(W)//G
φ
ψN()
ν
(3)
The morphism ψis dominant and therefore corresponds to an injective ring ho-
momorphism
C[N()] C[End(W)//G] = C[End(W)]G= (C[GL(W)]0)G,
due to Lemma A.3.4. Taking G-invariants is with respect to the left action of Gon
C[GL(W)] and considering polynomial submodules is with respect to the right action
of GL(W)on C[GL(W)], so these two operations commute. Hence,
C[N()] (C[GL(W)]G)0=C[]0.
As a consequence, the polynomial part of the coordinate ring of Pcan be identi-
fied with the ring of G-invariants in C[End(W)], where Gis the stabilizer of P.
6.3.1 Remark. There is a commutative diagram of GL(W)-equivariant inclusions of C-
algebras:
C[End(W)] C[]
C[]0C[End(W)]G
C[N()]
(4)
72
Here, C[]and C[End(W)]Ghave the same quotient field K. The inclusion
C[N()] C[End(W)]G=C[]0
is an inclusion of integrally closed subrings of K. (By definition, C[N()] is the
integral closure of C[]in K.)
We now show that (1) of Theorem 6.2.3 implies PEnd(W) = . Recall (3). The
condition C[N()] = C[]0holds if and only if the morphism ψis an isomorphism.
In this case it follows that φis the normalization of . Thus, (1) implies in particular
that φis surjective and therefore ωis surjective, which means PEnd(W) = .
We now ask when the inclusion C[N()] C[]0becomes an equality. We
will require an auxiliary lemma for the proof. The algebraic group C×=GL1acts
polynomially on a variety Xif the action morphism C××XXlifts to a morphism
C×XX. We will denote this map by a dot, i.e., (t,x) t.x.
6.3.2 Lemma. Let Xand Ybe affine varieties, each of them equipped with polynomial
C×-actions admitting unique fixed points 0XXand 0YY, respectively. Let
φ:XYbe a C×-equivariant morphism. Then, φ1(0Y) = {0X}if and only if φis
finite.
Proof. The “only if part is [Lan15, Lemma 7.6.3]. For the converse, assume that φis
finite. Let xXbe such that φ(x) = 0Y. Then, φ(t.x) = t.φ(x) = t.0Y=0Yfor all
tC×and hence, C×.xφ1(0Y). But φ1(0Y)is a finite set, therefore C×.xis finite
and irreducible, i.e., a point. This implies C×.x={x}, so xis a fixpoint for the action
of C×. It follows that x=0Xby uniqueness of the fixpoint.
Lemma 6.3.2 will be applied to the morphism φ: End(W)//G. We therefore
study the action of the scalar matrices C×GL(W)on End(W)//Gand . Observe
that the morphism φis equivariant with respect to this action. We need to make sure
that both varieties have a unique fixpoint in order to make use of Lemma 6.3.2.
We first reduce to the case where Vhas a unique fixpoint under the action of all
scalar matrices. For this purpose, fix some basis of W, so GL(W)
=GLnand let
V=λNnV(λ)be the decomposition of Vinto isotypical components, i.e., V(λ)is
a direct sum of irreducible modules of type λ. Note that the only weights λthat
appear are in Nnbecause Vis a polynomial GLn-module. Let P=λNnPλbe the
corresponding decomposition of P, i.e., PλV(λ). Observe that the point ˜
P:=PP0
has the same stabilizer as P, because any element of V0is GL(W)-invariant. Let
˜
V:=λ=0V(λ)be the complement of V0in V. Then,
={P0}ט
PV0ט
V
=V
and consequently, P={P0}ט
P
=˜
P. This shows that we may henceforth assume
V=˜
Vand P=˜
P. In this situation, the origin 0VVis the only fixpoint under the
action of the scalar matrices. Consequently, it is also the only C×-fixpoint in .
73
On the other hand, End(W)also has a unique fixpoint with respect to the left
action by scalar matrices, namely the zero map which we will denote by 0. At this
point, we require the following lemma to deduce that End(W)//Galso has a unique
fixpoint:
6.3.3 Lemma. Let Ebe an affine variety on which C×acts polynomially with a unique
fixpoint 0. Assume that a reductive group Gacts on Efrom the left such that the
actions of Gand C×commute. Then, the quotient E//Galso has a unique fixpoint
under the induced action of C×.
The proof of this lemma is slightly technical and will be given afterwards. Using
it, we conclude that π(0)is the unique fixpoint in X:=End(W)//Gand 0Vis the
unique fixpoint in Y:=. The morphism φ:XYnow satisfies the conditions of
Lemma 6.3.2: We proceed to prove the equivalence of (1) and (2).
We first show (1)(2). If (1) holds, ψis an isomorphism and φis a normalization
of . Therefore, φis a finite morphism. By one direction of Lemma 6.3.2, this implies
that φ1(0V) = {π(0)}. In other words, φ(π(a)) = 0Vimplies π(a) = π(0). We
have Pa=ω(a) = φ(π(a)), so Pa=0Vimplies π(a) = π(0), which is the same
as saying that the zero map 0 is contained in the closure of the G-orbit of a. This is
precisely (2).
For the converse implication, we assume (2). For any aEnd(W), the condition
0V=φ(π(a)) = ω(a)implies 0 Gaby (2). By construction of the GIT quo-
tient, this implies π(a) = π(0)and hence, φ1(0V) = {π(0)}. The other direction
of Lemma 6.3.2 now states that φis a finite morphism. Any finite morphism is inte-
gral [GW10, Remark 12.10], so φis an integral birational map from a normal variety
End(W)//Gto . By [GW10, Proposition 12.44], it follows that it is the normalization
of , so ψis an isomorphism.
Proof of Lemma 6.3.3.We first note that 0 is a fixpoint for the action of Gas well. In-
deed, for any hGand any tC×, we have t.h.0 =h.t.0 =h.0 because the actions
commute, so h.0 is a fixpoint for the action of C×. By uniqueness, this implies h.0 =0.
As hGwas arbitrary, 0 is a fixpoint for the action of G.
We will denote by π:EE//Gthe quotient morphism. Assume that x=π(a)is
any fixpoint of the action of C×. Observe that
{x}=C×.x=C×.π(a) = π(C×.a),
so C×.aπ1(x). Since the action of C×is polynomial, the orbit map lifts to a
C×-equivariant morphism γ:CEwith γ(t) = t.afor tC×. Since
t.γ(0) = γ(t·0) = γ(0)
74
for all tC×, it follows that γ(0)is a C×-fixpoint in E, so γ(0) = 0. This implies that
0C×.a. Because 0 is a fixpoint for the action of G, the set{0}Eis a closed G-orbit.
By the nature of the GIT-quotient, points of Ethat share a closed G-orbit are mapped
to the same point in the quotient. In this case, aand 0 share the closed orbit{0}and
it follows that x=π(a) = π(0). Thus, π(0)is the only fixpoint of the action of C×
on E//G.
75
Chapter 7
Techniques for Boundary Classification
We now turn to the more general case that PC[W]dis not closed, so there exist
forms in Pthat are not of the form Pafor any aEnd(W). Even so, any
degeneration of Pcan be obtained by approximation, as we will explain in Section 7.1.
In several examples, some components of Pare orbit closures of degenerations
of P. In some cases, like the 3 ×3 determinant, every component of the boundary
contains a dense orbit see Chapter 8. In fact, we have seen one such example
in Lemma 6.2.5. If a degeneration Qof Pis given, we can check whether Qis a
component of Pby testing whether dim(GQ) = dim(GP) + 1. The latter implies
that Qis a proper, GP-invariant, codimension one subvariety of P, hence disjoint
from Pand consequently a component of P. In Section 7.2, we describe a way to
compute the dimension of GQby means of linear algebra.
We can classify the components of Pin some cases by constructing all of them
in this way. One way to ensure that all components have been found is to give a sharp
upper bound on the number of components of P. The upper bound we will use is
developped in Section 7.3.
7.1 Approximating Degenerations
Let C[t]be the polynomial ring over C. For an element qEnd(W)C[t], we write
q=K
k=1qktkwith qkEnd(W), i.e., qkis the k-th coefficient of q. On the other hand,
one can also think of qas an endomorphism of the free C[t]-module WC[t]. Any
form PC[W]can also be viewed as a polynomial with coefficients in C[t], giving
meaning to the notation Pq.
7.1.1 Proposition. Let PC[W]d. There exists a natural number KNsuch that for
every QC[W]d, we have QPif and only if there exists a qEnd(W)C[t]
such that PqtK·Q(mod tK+1).
Proof. We denote by CJtKthe ring of formal power series with complex coefficients.
Furthermore, we denote by C((t)) its fraction field, which consists of power series with
77
finitely many terms of negative degree. Both rings contain C[t]in a natural way. By
[LL89, Proposition 1 and the corollary on p. 11], there is a number MNwith the
following property: For every QP, there exists qEnd(W)C((t)) such that we
have PqQ(mod t)and tMqCJtK. We define K:=dM and claim that it has the
desired properties.
For the direction, assume that ω(q) = Pq=tKQ+k>KQktkwith cer-
tain polynomials QkC[W]d. Let (ti)iNbe a sequence of complex numbers that
converges to zero. For every iN, pick a d-th root ζiof tiand define the endo-
morphism ai:=ζK
iq(ti)End(W). Since Pis homogeneous of degree d, we have
PζK
iidW=ζdK
iP=tK
iPand we may conclude that
Pi:=Pai=tK
i·(Pq(ti)) = tK
i(tK
iQ+
k>K
Qktk
i)=Q+k1Qk+Ktk
i
zero sequence
.
Hence, (Pi)iNis a sequence of elements in Pwhich converges to Q. Theorem 3.3.3
implies that QP.
For the other direction, fix some QC[W]dand let q=
k=Mqktkbe such that
PqQ(mod t). We then define ¯
q:=K
k=0qkMtkand note that it is sufficient to
show
P¯
qtK·Q(mod tK+1). (1)
Recall that V=C[W]dand let E:=End(W). The orbit map ω:EVgiven by
ω(q) = Pqis homogeneous of degree d, hence ωC[E]dV. We can therefore
write ω=r
i=1ωiQiwith certain ωiC[E]dand QiV.
Let pCJtKbe such that tM·q=¯
q+tK+1p. Then,
ω(q) = ω(tM(¯
q+tK+1p))=tK·ω(¯
q+tK+1p)
=
r
i=1
tK·ωi(¯
q+tK+1p)·Qi
r
i=1
tK·ωi(¯
q)·Qi=tK·ω(¯
q) (mod t).
Hence, QPq=ω(q)tK·ω(¯
q) = tK·(P¯
q) (mod t). Multiplying this
equation by tKyields (1).
7.1.2 Remark. Note that for a particular Q, it can happen that the qfrom Proposi-
tion 7.1.1 is divisible by t, say q=trp. Then, we have
tKQPq=P(trp) = tdr ·(Pp) (mod tK+1)
so with k:=Kdr <K, we have tkQPp(mod tk+1).
78
7.1.3 Definition. Let P,QC[W]d. A power series qEnd(W)C[t]which satisfies
Pqtk·Q(mod tk+1)is called an approximation path from Pto Qand the num-
ber kis called its order. The minimum order of all approximation paths from Pto Q
is called the approximation order of Qwith respect to P.
The order of approximation of Pis defined to be smalles number Kthat can be
chosen in Proposition 7.1.1. It is also the maximum, taken over all QP, of the
approximation order of Qwith respect to P.
7.1.4 Example. We give an example from the well-known study of ternary cubic
forms, see [Kra85, I.7] for a complete classification. Compare also Lemma 6.2.5. We
consider the homogeneous form P=zx2y3z3C[x,y,z]3. It defines an irre-
ducible, nonsingular cubic curve and it is known that dim(GP) = 0. Let
q:=(1 0 0
0 0 0
0 0 0 )+(0 0 0
0 1 0
0 0 0 )·t+(0 0 0
0 0 0
0 0 1 )·t3=diag(1, t,t3)End(C3)C[t].
Then, we have k=3 and
Pq= (t3z)x2(ty)3(t3z)3=t3·(zx2y3)t9·z3
Note that Q:=zx2y3is the equation of a cusp, which is singular. In particular we
have Q/Pand one can compute dim(GQ) = 1 with the methods of Section 7.2. It
follows that Qis an irreducible component of P.
Since the set End(W)C[t]is entirely too large to choose from, we need more in-
formation on how to choose approximation paths. There are only two straightforward
observations we can make.
7.1.5 Remark. Let PC[W]d. An element QPhas approximation order zero
with respect to Pif and only if QPEnd(W). In particular, Pis closed if and only
if Phas order of approximation zero.
7.1.6 Remark. Let QPhave approximation order k1 and let q=K
i=0qitibe
an approximation path from Pto Qof order k. Then, Pq0=0, because it is the
coefficient of tin Pq. In other words, q0 AP.
We also ask the following question:
7.1.7 Question. Is the order of approximation of Prelated to the degree of the
unique SL(W)-invariant from Theorem 5.2.3?
If PC[W]dand QP, we say that Qis a linear degeneration of Pif there is
alinear approximation path from Pto Q, i.e., one of the form b+at with b AP. For
many cases we consider, only linear degenerations occur.
79
7.2 The Lie Algebra Action
The Lie algebra of an algebraic group Gis defined as the tangent space of Gat the
identity element 1 G. We denote it by Lie(G):=T1(G). It carries the structure of a
Lie algebra, but this will not be of major importance for our considerations.
There is a strong relationship between the identity component Gof an algebraic
group and its Lie algebra, c.f. [Hum98, 9] and [TY05, 23]. Moreover, there is the notion
of a Lie algebra action on a vector space, and the action of an algebraic group induces
an action of its Lie algebra as follows: The action of Gon a space Vis a morphism
ϱ:GGL(V)of algebraic groups. Taking the derivative of this morphism at the
identity yields a linear map Dϱ:Lie(G)Lie(GL(V))
=End(V). The action of
an element aLie(G)on vVreturns the vector Dϱ(a)(v)V. We also use the
common shorthands gl(V):=Lie(GL(V)) and gln:=Lie(GLn).
By Proposition A.1.3 Gis smooth, so we know that the dimension of Lie(G)as a
vector space is equal to the dimension of Gas a variety. We will describe the vector
space Lie(GP)explicitly in Corollary 7.2.3. Together with Theorem A.1.9, this also
provides a way to compute the dimension of P.
Recall the partial derivative wPof PC[W]din direction wWfrom Defini-
tion 5.1.1.
7.2.1 Proposition. Let V=C[W]dand let GL(W)act on Vby precomposition. Then,
the induced action of gl(W) = Lie(GL(W)) = End(W) = WWon Vis given by:
V×gl(W) V
(P,a) Pa
where P(yw) = y·wPfor rank one tensors.
Proof. Let ϱ: GL(W)GL(V)be the morphism of algebraic groups corresponding
to the action of GL(W)on V. We have to compute the differential Dϱ:gl(W)gl(V).
Define g(t):=idW+ (ty w), it satisfies g(0) = idWand g(0) = yw. Consider a
symmetric power xdSymdW=Vwith xWand observe
ϱ(g(t))(xd) = xdg(t) = xd(idW+ (ty w)) = (x+t·x(w)·y)d.
The coefficient of tin this expression is Dϱ(yw)(xd). Expanding the right hand
side, we can see that the coefficient of tis equal to d·y·x(w)·xd1=y·wxd. Since
symmetric powers span V, the result follows.
80
7.2.2 Corollary. If we choose coordinates C[W] = C[x1, . . . , xn], this gives an action of
GLnon V. The corresponding action of gln=Lie(GLn)in coordinates is given by
V×gln V
(P,a)
n
i=1
n
j=1
aij ·xj·iP
where iPdenotes the partial derivative of Pwith respect to xi.
Proof. After choosing coordinates, a matrix a= (aij)Cn×n
=glncorresponds to the
tensor a=n
i=1n
j=1aij ·(xjei), where eiWis the dual basis vector of xiW.
The result follows from Proposition 7.2.1.
7.2.3 Corollary. Let V=C[x1, . . . , xn]dand let GLnact on Vby precomposition. For
PV, we have
Lie(GP)
={aCn×n
n
j=1
n
i=1(aij ·xj·iP)=0}
Proof. This follows from Corollary 7.2.2 and the fact that Lie(GP)is the set of all
agl(W)that send Pto zero, see [TY05, 25.1.3].
7.2.4 Example. Let W:=Cd×d, so detdC[W]d. Let H:=G
detdbe the identity
component of the stabilizer of detd. We want to study Lie(H). For a square matrix
AW, we denote by tr(A):=d
r=1Arr its trace.
There is a bilinear map W×WEnd(W)defined by (A,B) AB, where
ABdenotes the map C ACBt. It is straightforward to check that this induces an
isomorphism WW
=End(W). Let Eij Wbe the matrix that has the entry 1 in
position (i,j)and zeros elsewhere. These matrices form a basis of Wand the matrix
representation of ABwith respect to this basis is the Kronecker product
(AB)(ij),(rs)= (AB)(Ers)ij = (A·Ers ·Bt)ij =AirBt
sj =AirBjs.
Let xij Wbe the dual vector of Eij W, so that C[W]is a polynomial ring
in the variables xij. Then detd=det(x)where x= (xij)is the matrix containing all
the variables. We denote by xthe adjugate of the matrix x, i.e., the transpose of the
cofactor matrix of x. By definition, x
sr =rs detd. By Corollary 7.2.2, the action of
ABEnd(W) = gl(W)on the form detdis given by
i,j
r,s
(AB)(ij),(rs)·xij ·x
sr =
i,j,r,s
At
rixijBjsx
sr =tr(AtxBx).
81
In summary, we have ABLie(H)if and only if tr(AtxBx) = 0 for all xW.
Since GLdWis dense and S=det(S)·S1for all SGLd, we conclude:
ABLie(H) SGLd: tr(AtSBS1) = 0. (2)
Theorem 3.4.1 states that there is a surjective homomorphism of algebraic groups
SLd×SLdH, which induces sld×sldLie(H)
(A,B) AB(A,B) (AI) + (IB).
Note that sld:=Lie(SLd)is equal to the vector space of d×dmatrices whose trace
vanishes [TY05, 19.1]. Clearly, (AI) + (IB)satisfies (2) when Aand Bare trace-
less, but it is not immediately obvious that any sum r
i=1AiBisatisfying (2) must
be of the form (AI) + (IB)with Aand Btraceless.
7.3 Resolving the Rational Orbit Map
Recall the rational map ϖPfrom (1) in Section 5.3. We denote by dom(ϖP)its domain,
i.e., the maximal open subset of PEnd(W)on which ϖis defined. There is a well-
known classical way to resolve the indeterminacies of a rational map: One considers
the graph
Γ:=Γ(ϖP) ={(a,ϖP(a))adom(ϖ)}PEnd(W)×PP
which has two natural morphisms β:ΓPEnd(W)and γ:ΓPPinduced by
the projections. By definition, Γis a projective variety. Since γis a dominant projective
morphism, it is surjective. With a good understanding of Γ, we could deduce a lot of
information about PP.
Unfortunately, while PEnd(W)is just a projective space, the variety Γand the
morphism βare not well-understood in general. A priori, we only know that βis a
blowup [Har06, II.7.17.3]. This is not very informative because any birational projec-
tive morphism is a blowup, see [Har06, II.7.17].
We want to give a minimal treatment here, even though one can define the blowup
of any scheme in a closed subscheme, see [GW10, pp. 406] and [Har06, pp. 160]. For
a classical treatment in the language of varieties, see [Har95, pp. 80]. We do require a
hint of scheme language but want to avoid the technical overhead: Hence, we restrict
to the affine case and heavily rely on the fact that blowups are local: See [GW10,
Prop. 13.91(2)] and use that open immersions are flat [GW10, Prop. 14.3(4)].
82
7.3.1 Definition. Let Xbe an affine complex variety and IC[X]an ideal. It cor-
responds uniquely to a subscheme ˆ
A:=Spec(C[X]/I)of X. Choose generators
I=φ0, . . . , φrand consider the rational map
φ:X Pr
x [φ0(x): . . . : φr(x)]
We denote by Γ:=Γ(φ)the closure of{(x,φ(x)) |xdom(φ)}X×Pr, it is called
the graph of φ. The projection β:ΓXis called the blowup of Xwith center ˆ
A. Up
to an automorphism of Γ, this does not depend on the choice of the generators of I,
see [Hü12, Prop. 1.19].
Remark. We will usually denote by A Xthe subvariety of Xcorresponding to the
radical ideal I, i.e., Ais the support of ˆ
A.
7.3.2 Definition. In line with the above definition, we denote by ˆ
APEnd(W)the
closed subscheme of End(W)which corresponds to the ideal generated by the coef-
ficients of Pa. Furthermore, we denote by ˆ
Ass
Pthe closed subscheme of the variety
End(W)ss which is defined locally by the same equations as ˆ
AP.
Note that ˆ
APis supported on AP={aEnd(W)|Pa=0}. The graph of ϖPis
the projectivization of the blowup of End(W)with center ˆ
AP.
7.3.3 Example. In general, we do not have AP=ˆ
AP, i.e., the scheme ˆ
APis not
reduced. Consider W=C3and let PC[W]3be any regular cubic. The affine
cone Z(P)Wis an irreducible surface of degree 3, therefore it cannot contain any
linear space of dimension two. It follows that the maximal linear subspaces of Z(P)
are the lines contained in this cone and any nonzero a APis a rank one linear map
whose image is spanned by a point wZ(P). Hence, under the canonical isomor-
phism E:=End(W) = WW, we have a=wxfor some xW. The projective
class of ais a point in the segre embedding
P2×P2
=PW×PWP(WW) = PE
whose first coordinate is a point on the projective smooth curve defined by P, i.e., we
have PAP=PZ(P)×P2. In particular, PAPis a smooth variety.
However, ˆ
APis not even a variety: On the one hand, the coefficients of Paare
homogeneous polynomials of degree 3, so the ideal IC[End(W)] generated by
them cannot contain any degree two polynomial. On the other hand, let µC[E]2
be any 2 ×2 minor, then µvanishes on AP=Z(I)and therefore µI. Since I
contains no element of degree 2, we also have µ/I. Hence, I=I, so ˆ
AP=AP.
Note also that passing to semistable points does not change this fact: In this par-
ticular example, the stabilizer of Pis a finite group by Lemma 6.2.5 and so every point
is semistable. Thus, Ass
P=ˆ
Ass
P.
83
7.3.4 Remark. Example 7.3.3 generalizes as follows: Let n:=dim(W)and PC[W]d.
The hypersurface Z(P)cannot contain a linear subspace of dimension n1, so any
endomorphism a APsatisfies rk(a)<n1. Therefore, the ideal of ˆ
APcontains
some power of the (n1)×(n1)minors, which are polynomials of degree n1.
The coefficients of Paare homogeneous polynomials of degree din the entries of a,
so the ideal generated by them is not radical when dn.
If Pis generic, then Z(P)is smooth. Assuming d>1, it can be shown [Sha94,
Exercise II.1.13] that every linear subspace of Z(P)has dimension at most n
2, so ˆ
APis
nonreduced already when dn
2.
7.3.5. For the rest of this section, we set E:=PEnd(W)and G:=PGL(W)for brevity.
A modest invariant of the boundary Pis its number of irreducible components.
We are specifically interested in giving an upper bound on this number. We have the
following commutative diagram:
EϖPPP
Γ
β
γ
Define U:=β1(G) = γ1(PP)and let Z:=Γ\Ube its complement.
Claim. The number of irreducible components of Zis an upper bound on the number
of irreducible components of P.
Proof of Claim. By commutativity, U=β1(G) = γ1(PP)and the restriction of γ
to Uis an isomorphism onto PP. Since γis also surjective, it follows that
γ(Z) = γ(Γ\U) = γ(Γ\γ1(PP)) = PP\PP=PP. (3)
Since PPhas the same number of irreducible components as P, the statement
follows.
Unfortunately, this number does not seem easier to estimate than the number of
components of Pitself. We will see later that passing to semistable points helps
tremendously, which is what we will discuss next.
We give V:=End(W)C[W]daGP-module structure by acting trivially on the
right tensor factor. The graph Γof ϖPadmits the canonical embedding
Γ:={([a],[Pa]) |a/ AP}PEnd(W)×PPP(V)(4)
via the Segre map, see [Lan12, Def. 4.3.4.1] and the following discussion.
7.3.6 Lemma. The embedding (4) realizes Γas a GP-invariant subvariety of P(V).
With the language of Subsection A.1.2, it turns Γinto a linearized GP-variety.
Furthermore, Γss is the graph of the restriction of ϖPto PEnd(W)ss.
84
Proof. We define Γ:={([a],[Pa]) |a/ AP}. It is an open, dense, GP-invariant subset
of Γand therefore, Γ=Γis closed and GP-invariant.
For the second claim, first note that a tuple ([a],[Q]) Γis semistable if and
only if aEnd(W)ss by definition of the group action. Hence, Γss is the intersection
of PEnd(W)ss ×PPwith Γ. Since Γss also contains Γby Proposition 5.3.4, an
elementary topological argument yields the statement.
7.3.7 Definition. A blowup of a smooth variety with a smooth irreducible center of
codimension at least two is called a smooth blowup.
Remark. The condition on the codimension is only to avoid pathologies. If the center
has codimension one, the blowup is an isomorphism [Har06, II.7.14]. Note also that
the center of a smooth blowup is always reduced, i.e., a variety [Eis94, Cor. 10.14].
The following is our main tool for bounding the number of components of P.
Restricting to semistable points will allow us to actually verify the assumption in
some applications.
7.3.8 Proposition. Let β:Γss (PEnd(W))ss be the projection from the graph of the
restriction of ϖPto semistable points. If βfactors as a sequence
Γss =Yk
βk
··· β2
Y1β1
Ess
of ksmooth blowups, then Phas at most k+1 irreducible components.
The proof of Proposition 7.3.8 will finish this section. We will require the following
well-known result which can be found in [Har06, II.8.24]. Intuitively, it states that a
smooth blowup only spawns a single irreducible component.
7.3.9 Proposition. Let β:ΓXbe a smooth blowup with center A. Then, Γis a
smooth variety and β1(A)is a smooth, irreducible, codimension one subvariety of Γ
outside of which βis an isomorphism.
Note also that Proposition 7.3.8 would be easier to verify if we replaced Ess by E,
because the latter is projective. Since Ess is usually not projective, neither is Γss and we
have to pass to a quotient to regain this property. We will discuss this process before
proceeding to prove Proposition 7.3.8.
7.3.10. Clearly, β:ΓEis GP-equivariant and γ:ΓPPis GP-invariant, in
particular there is a unique morphism ˜
γ:Γss//GPPPsuch that
Γss
γ
β
π
Ess
ϖP
Γss//GP˜
γPP
(5)
commutes, by Proposition A.1.11.
85
7.3.11 Lemma. The morphism ˜
γin (5) is a birational, surjective morphism and its
restriction to ˜
U:=˜
γ1(PP)is an isomorphism ˜
U
=PP.
Proof. Since ϖPand βare dominant, so is ˜
γ. The variety Γss//GPis projective, there-
fore ˜
γhas closed image, so it is surjective. The orbit map ϖPrestricts to a surjective
morphism PGL(W)PP, and its graph is the open subset
U={([g],[Pg]) |gGL(W)}Γss.
Hence, βrestricts to am isomorphism U
=PGL(W)of GP-varieties. We note here
that ˜
U=π(U)and ˜
γ(˜
U) = γ(U) = PP, so ˜
γ:˜
UPPis a well-defined surjecive
morphism. As PPis smooth, ˜
γis an isomorphism if and only if it is injective, by
Zariski’s Main Theorem [TY05, 17.4.6]. Let [Q]PP, then we are left to show that
the fiber ˜
γ1([Q]) contains exactly one element. To see this, let gGL(W)be such
that Q=Pg. Then,
˜
γ1([Q]) = π(β1(ϖ1([Q]))) = π(β1(GP[g])).
Since βis an equivariant isomorphism, β1(GP[g]) is an orbit. Since πis constant
on orbits, we are done.
Proof of Proposition 7.3.8.We define ˜
Γ:=Γss//GPand consider the diagram (5). We
recall that G=PGL(W)PEnd(W) = Eand U=γ1(PP) = β1(G).
Set Z:=Γss \U. By Lemma 7.3.11, the morphism ˜
γ:˜
ΓPPis birational,
surjective, and restricts to an isomorphism on ˜
U=˜
γ1(PP) = π(U). We also
define the set ˜
Z:=π(Z).
Claim. We have ˜
γ(˜
Z) = PP.
Proof of Claim. As every GP-orbit in U
=Gis closed, Property (G4) of the good quo-
tient πimplies that ˜
Uand ˜
Zare disjoint. Since πis surjective, ˜
Zis the complement
of ˜
Uin ˜
Γ. The claim follows as in (3) because ˜
γis surjective by Lemma 7.3.11.
As PP=˜
γ(˜
Z) = ( ˜
γπ)(Z)isthe image of Zunder a morphism, we are left to
verify that Zhas at most k+1 irreducible components.
Recall that βis a composition
Γss =Yk
βk
··· β2
Y1β1
Y0=Ess
of ksmooth blowups. We set ˆ
βi:=β1. . . βifor all 1 ik. Furthermore, denote
by Z0:=Ess \Gthe set of projective classes of semistable endomorphisms that are
noninvertible. Since Ess is open in E, it follows that Z0is an irreducible hypersurface
in Ess. Let Zi:=ˆ
β1
i(Z0)for 1 ik. Since the complement of ZkYk=Γss is
equal to U=β1(G), we have Z=Zk. We prove by induction on ithat Zihas at most
86
i+1 irreducible components, which is tautological for i=0 and proves our initial
claim for i=k. For the induction step, we assume 1 ikand that Zi1has at
most iirreducible components.
Claim. We have AiZi1Yi1, where Aiis the center of βi:YiYi1.
Proof of Claim. Assume for contradiction that Aiintersects ˆ
β1
i1(G). Since Aiis of
codimension two and β1
i(Ai)is of codimension one by Proposition 7.3.9, the Fiber
Dimension Theorem [TY05, 15.5.4] implies that there is a point gGsuch that ˆ
β1
i(g)
has positive dimension. As all βjare surjective, this would mean that β1(g)has
positive dimension, which is a contradiction.
Since AiZi1, we get that
Zi=β1
i(Zi1) = β1
i(Zi1\Ai)β1
i(Ai).
By Proposition 7.3.9,β1
i(Ai)is irreducible and β1
i(Zi1\Ai)
=Zi1\Aihas at
most iirreducible components. Therefore, Zihas at most i+1 irreducible compo-
nents.
7.3.12 Remark. One can prove an even more refined statement than Proposition 7.3.8.
Under the assumptions of the proposition, Z:=Γss \β1(G)is a GP-invariant closed
subset of Γss. The action of GPon Γss therefore permutes the finite set of irreducible
components of Z. Denoting by the number of orbits of this action, it follows from
the proof of Proposition 7.3.8 that the number of irreducible components of Pis at
most +1.
87
Chapter 8
The 3by 3 Determinant Polynomial
As outlined in Chapter 3, the orbit closure and boundary of the determinant poly-
nomial is of particular interest for GCT. Yet, very few explicit results describing the
geometry are known in low dimension. In this chapter we give a description of the
boundary of the orbit of the 3 ×3 determinant. These results have been previously
published in [HL16].
We view det3as the polynomial
det3:=det (x1x2x3
x4x5x6
x7x8x9)C[x1, . . . , x9]3,
a homogeneous polynomial of degree 3 on the space W:=C3×3. As before, we denote
by V:=C[W]3the space of all homogeneous forms of degree 3 on W.
Our main result is a description of (det3)that answers a question of Landsberg
[Lan15, Problem 5.4]: The two known components are the only ones. In Section 8.1 we
explain the construction of the two components. Our contribution lies in Section 8.2
where we show that there is no other component.
8.0.1 Theorem. The boundary (det3)has exactly two irreducible components:
The orbit closure of the determinant of the generic traceless matrix, namely
Q1:=det
x1x2x3
x4x5x6
x7x8x1x5
;
The orbit closure of the universal homogeneous polynomial of degree two in three
variables, namely
Q2:=x4·x2
1+x5·x2
2+x6·x2
3+x7·x1x2+x8·x2x3+x9·x1x3.
Remark. The two components are different in nature: (Q1)det3End(W)is the
orbit closure of a polynomial in only eight variables; the second component is more
subtle and is not contained in det3End(W). This component has analogues in higher
dimension and some results are known about them [LMR13].
89
8.1 Construction of Two Components of the Boundary
We denote by ω: End(W)(det3)the orbit map ω(a) = det3a. Recall also the
description of Gdet3from Theorem 3.4.1.
8.1.1 Lemma. We have dim((det3)) = 65 and dim((Q1)) = dim((Q2)) = 64.
Proof. The stabilizer Gdet3has dimension 16 by Theorem 3.4.1, thereby it follows
that dim(det3) = 81 16 =65 by Theorem A.1.9.
The dimension of (Qi)for i{1, 2}can be deduced from its Lie algebra. By
Corollary 7.2.3, this amounts to computing the rank of a 165 ×81 matrix, which is
easy using a computer, see Program 8.1.
1from sympy import *
2
3def OrbitDimension(P,X):
4P = Poly(P,X)
5Df = [ Poly(x)*diff(P,y) for xin Xfor yin X ]
6Mn = list(set(sum((Poly(Q).monoms() for Qin Df),[])))
7return Matrix([[ Q.coeff_monomial(m) for Qin Df ]
8for min Mn ]).rank()
9
10 x,y,z,a,b,c,d,e,f = X = symbols(xyzabcdef)
11
12 Q1 = Matrix([[x,y,z],[a,b,c],[d,e,-x-b]]).det()
13 Q2 = a*x**2+b*y**2+c*z**2+d*x*y+e*y*z+f*z*x
14
15 print(OrbitDimension(Q1,X))
16 print(OrbitDimension(Q2,X))
Program 8.1: Stabilizer computation with Python [Pyt;Sym].
8.1.2 Lemma. (Q1)is an irreducible component of (det3).
Proof. Since det3is concise and Q1is not, we have (Q1)(det3). Lemma 8.1.1
implies that (Q1)has codimension one in (det3)and must therefore be an irre-
ducible component of (det3).
8.1.3 Lemma. (Q2)is an irreducible component of (det3), distinct from (Q1).
Proof. It is easy to see that Q2is concise whereas Q1is not, so (Q1)contains no
concise polynomial, but (Q2)does. It follows that the two orbit closures are distinct.
Let
b:=(0x1x2
x10x3
x2x30)and a:=(2x6x8x9
x82x5x7
x9x72x4).
90
Th entries of these matrices are linear forms on W, so a,bEnd(W)with bpro-
jecting onto the space of antisymmetric matrices and aprojecting onto its orthogonal
complement of symmetric matrices.
Recall Section 7.1. To show that Q2(det3), we will use the approximation
path b+at as outlined. It is clear that b Adet3. The coefficient of tin det(b+ta)
is equal to tr(ba)by Jacobi’s formula, where bis the adjugate matrix of b. Fur-
thermore, we have b=utuwith u= (x3,x2,x1). Since tr(ba) = uaut=2Q2, we
have [Q2]P(det3)by Proposition 7.1.1. Lemma 8.1.1 implies that (Q2)has the
same dimension as (det3), so it is one of its irreducible components.
Note that Lemma 8.1.3 generalizes to higher dimensions: the limit of the deter-
minant on the space of skew-symmetric matrices always leads to a component of the
boundary of the orbit of detd, when d3 is odd, as shown by Landsberg, Manivel,
and Ressayre [LMR13, Prop. 3.5.1].
8.2 There Are Only Two Components
Throughout this section, we will denote by G:=Gdet3the stabilizer group of det3.
Recall the contents of Section 5.3. The annihilator A:=Adet3is precisely known,
thanks to the classification of the maximal linear subspaces of Wcontaining only
singular matrices [Atk83;FLR85;EH88]. These spaces are precisely the maximal
linear subspaces of Z(det3), see also Remark 5.3.2.
For every a A, there is a hGsuch that im(ha)is a subset of one of the
following spaces of singular matrices:
(
000),( 0
0
0),(0 0
0 0
)and (0αβ
α0γ
βγ0)for α,β,γC.
The first three are called compression spaces, we denote them by L1,L2and L3. The
fourth is the space of 3 ×3 skew-symmetric matrices, which we will denote by L0. Let
us write
End(W,L):={aEnd(W)|im(a)L}(1)
for a linear subspace LW. The sets Ei:=GEnd(W,Li)constitute the four
irreducible components of A. For example,
E0={[a]P(E)|S,TSL3: im(a)S·L0·T}.
Indeed, they are irreducible because they are the image of the irreducible variety
G×End(W,Li)under the action morphism. Also, the Eiare not contained in one
another and Ais their union.
We denote by Ess
ithe set of semistable endomorphisms in Ei.
91
8.2.1 Lemma. We have Ass =Ess
0=.
Proof. Let gt:=diag(t,t,t2), then gtSL3for all tCand
gt·(
000),( 0
0
0)·gtand g1
t·(0 0
0 0
)·g1
t
all tend to 0 when t0, for any constants . This proves that the first three compo-
nents do not meet End(W)ss. To show that Ass is not empty, we exhibit an invariant
function fC[End(W)]G, which does not vanish at a point bEnd(W,L0). Pick any
three matrices A,B,CWand let ˜
f: End(W)Cbe the regular function
˜
f(a):=tr(a(A)·a(B)·a(C)·a(A+B+C)).
The dot here is multiplication of 3 ×3 matrices. This is a G-invariant polynomial of
degree 6. Indeed, if hGis the map A SAT for some S,TSL3, then
˜
f(ha) = tr(S·a(A)·T·T·a(B)·S·S·a(C)·T·T·a(A+B+C)·S)=˜
f(a)
because S=S1and T=T1. It follows that f:=˜
f+ ( ˜
fτ)is G-invariant,
where τEnd(W)is the transposition map τ(A):=At. We now provide a point
bEnd(W)with f(b)=0. Let bbe the projection b:WL0which has the following
description in coordinates:
b=(0x1x2
x10x3
x2x30). (2)
For generic choices A,B,CW, a simple computation shows that f(b)=0.
1from sympy import Matrix
2
3A = Matrix([[1,1,1],[1,1,1],[1,1,1]])
4B = Matrix([[1,0,1],[0,1,0],[1,0,1]])
5C = Matrix([[0,0,1],[0,1,0],[1,0,0]])
6
7b = lambda A: (lambda x: Matrix(
8[[ 0 , x[0], -x[1] ],
9[ -x[0], 0 , x[2] ],
10 [ x[1], -x[2], 0 ]]) )(list(A))
11
12 t = lambda a: ( a(A) *(a(B).adjugate()) *a(C) *
13 (a(A+B+C).adjugate()) ).trace()
14 f = lambda a: t(a) + t(lambda A: a(A).transpose())
15 print(f(b))
The above Python program [Pyt;Sym] evaluates fon bfor a particular choice of
matrices A,Band Cgiving f(b) = 2.
92
8.2.2 Lemma. An endomorphism a Ass satisfies rk(a)3.
Proof. In [BD06, Thm. 2 and discussion above it], Bürgin and Draisma show that
any 2-dimensional subspace of Econtaining only singular matrices is contained in a
compression space. Therefore, if the image of ahad dimension 2 or less, then awould
lie in the nullcone, which is contrary to the choice of a.
Recall Section 7.3, in particular Definition 7.3.1. We set ˆ
Ass :=ˆ
Ass
det3, so Pˆ
Ass
are the indeterminacies of ϖ:PEnd(W)P(det3). The following proposition in
conjuction with Proposition 7.3.8 implies that (det3)has at most two irreducible
components, both of which we have described in Section 8.1. This finishes the proof
of Theorem 8.0.1.
8.2.3 Proposition. The subscheme Pˆ
Ass PEnd(W)is a smooth subvariety. In par-
ticular, the projection β:Γss PEnd(W)ss is a smooth blowup, where Γss denotes
the graph of ϖrestricted to PEnd(W)ss.
Proof. We set E:=End(W)to shorten notation. By Definition 7.3.1 and Lemma 7.3.6,
Γss is the blowup of PEss along Pˆ
Ass. Therefore, we have to check that Pˆ
Ass is
irreducible and smooth, the latter also implies that it is reduced, i.e., ˆ
Ass =Ass.
Since scalar matrices stabilize any point in P(V), the stabilizer of [det3]P(V)is
the group H:=GCidWGL(W).
Claim. Let b Ass be the point defined in (2). We claim Ass =HbGL(W), the
orbit of bunder the action of H×GL(W)by multiplication from left and right.
Proof of Claim. For ”, note that the left hand side is invariant under both actions
and contains b. Conversely, let a Ass. By Lemma 8.2.1 and because L0is invariant
under transposition, we may assume that the image of ais included in L0, up to
replacing aby another point in its orbit Ha. By Lemma 8.2.2, we also know that
rk(a)3, so asurjects onto L0. This implies that there is some gGL(W)such
that a=bg, and thus aHbGL(W).
In particular, Ass is irreducible, and smooth by Theorem A.1.9.(1). We will now
show that ˆ
Ass is also smooth. Since ˆ
Ass is invariant under the actions of both H
and GL(W)and its support Ass is an orbit under the same action, it suffices to verify
that ˆ
Ass is smooth at one single point, say b. This amounts to checking that the
dimension of the tangent space Tbˆ
Ass equals the dimension of Ass.
Recall that ˆ
Ais the scheme given by the coefficients of det3a=0, as polynomials
in the entries of a. By the Jacobian criterion [EH00, §V.3], we obtain the description
Tbˆ
Ass
={cEt2divides det3(b+tc)},
93
where tis a formal variable. The dimension of Tbˆ
Ass can be determined by means of
Program 8.2: It is equal to 35.
To calculate the dimension of Ass, we use the fact that it is an orbit under the
action of H×GL(W). More precisely, we consider the derivative of the orbit map
H×GL(W) Ass
(a,c) abc
at the neutral element, yielding a surjective linear map
Lie(H)×gl(W) TbAss
(a,c) ab +bc
We have gl(W)
=E. Recall the Lie algebra of Gfrom Example 7.2.4. Similar to that
example, we can check that the Lie algebra of His given by the maps (SI) + (IT)
for S,TW i.e., there is no condition on Sand Tto be traceless because Hcontains
scalar matrices. We can therefore express the tangent space as
TbAss ={ab +bc |aLie(H),cgl(W)}TbE
=E
={(SI)b+ (IT)b+bc|S,TW,cE}.
In other words, TbAss consists of all maps of the form
W W
C S·b(C) + b(C)·T+b(c(C))
for certain cEand S,TW. Program 8.2 also verifies that this space has dimen-
sion 35, which terminates the proof.
94
1from sympy import *
2from sympy.abc import t
3
4b = lambda A: (lambda x: Matrix(
5[[ 0 , x[0], -x[1] ],
6[ -x[0], 0 , x[2] ],
7[ x[1], -x[2], 0 ]]) )(list(A))
8
9generic_matrix = lambda v,n: ( lambda s:
10 Matrix([ s[i::n] for iin range(n) ]) )(
11 symbols(’%s:%d’%(v,n*n)) )
12
13 A = generic_matrix(’a’,3)
14 c = generic_matrix(’c’,9)
15
16 cA = list(c*Matrix(list(A)))
17 cA = Matrix( [cA[i::3] for iin range(3)] )
18 cV = list(c)
19
20 D = (b(A) + t *cA).det()
21 L = Poly( diff(D,t).subs(t,0), list(A) ).coeffs()
22 L = [ collect(l,cV) for lin L ]
23 M = Matrix([[ term.coeff(i) for iin cV ] for term in L ])
24
25 # dim(End(W)) = 81
26 print("tangent space of indeterminacy:", 81-M.rank())
27
28 S = generic_matrix(’s’,3)
29 T = generic_matrix(’t’,3)
30 V = list(S) + list(T) + cV
31 B = S*b(A) + b(A)*T + b(cA)
32 L = [t for lin list(B) for tin Poly(l,list(A)).coeffs()]
33 N = Matrix([[ term.coeff(i) for iin V ] for term in L ])
34
35 print("tangent space of HxGL(W)-orbit:", N.rank())
Program 8.2: Tangent space computation with Python [Pyt;Sym].
95
8.3 The Traceless Determinant
A computation as in Lemma 8.1.2 shows that for d{3,4,5}, the orbit closure of the
traceless d×ddeterminant is a component of (detd). We will show in this section
that it is true for all d3. The main work in proving this claim is the following
description of the stabilizer of the traceless determinant. In fact, its dimension is
the only information required, but a complete description is certainly of independent
interest.
8.3.1 Theorem. Let W={ACd×dtr(A) = 0}with d3 and let PC[W]dbe
the restriction of detdto W. The stabilizer group GPis reductive of dimension d21.
Moreover,
(1) The identity component of the stabilizer of Pis the group
G
P={gGL(W)SSLd:AW:g(A) = SAS1}.
(2) Let GL(W)be the group generated by
the transposition map t:WW,t(A) = At
and all ζ·idWwhere ζCis a d-th root of unity.
Then, GP=·G
Pand GPhas||=2dconnected components.
The proof is postponed briefly in order to first draw the announced conclusion.
8.3.2 Corollary. The orbit closure of the traceless determinant is an irreducible com-
ponent of (detd), for all d3.
For the proof of this corollary, we also require the following technical lemma,
which we will prove in direct succession.
8.3.3 Lemma. Let M
=Cn,DC[M]da concise form and WMa linear subspace
of codimension k. Let aEnd(M,W)be any projection onto W, let Q:=Daand
denote by PC[W]dthe restriction of Qto W. Then, dim(GQ) = dim(GP) + kn.
Proof of Corollary 8.3.2.We set M:=Cd×d,D:=detdC[M]dand W:=ker(tr). We
use the notation in Lemma 8.3.3, where k=1 and n=d2. By Theorem 8.3.1 we have
dim(GP) = n1 and Lemma 8.3.3 states that dim(GQ) = n1+n=2n1. By
Theorems A.1.9 and 3.4.1 this implies dim(Q) = dim(detd)1.
Proof of Lemma 8.3.3.We can choose coordinates C[M] = C[x1, . . . , xn]such that the xi
with inkare coordinates on W. Then, Qis simply the polynomial that arises
from Dby setting xito zero for i>nk, in the chosen coordinates. By Corollary 7.2.3,
Lie(GQ) ={aCn×n
n
i=1
n
j=1
aijxjiQ=0}={aCn×n
nk
i=1
n
j=1
aijxjiQ=0}
96
because iQ=0 for i>nk. It suffices to show that dim(Lie(GQ)) = dim(GP) + kn.
We have
dim({aCn×n
nk
i=1
nk
j=1
aijxjiQ=0}) (3)
=dim({aC(nk)×(nk)
n1
i=1
n1
j=1
aijxjiP=0})+ (n2(nk)2)
=dim(Lie(GP)) + 2kn k2
We will show thatxjiQ1ink<jn,Chas dimension k(nk)and trivial
intersection with xjiQ|1i,jnkC. This implies that Lie(GQ)arises from (3)
by imposing k(nk) = kn k2additional linearly independent conditions, dropping
the dimension from dim(GP) + 2kn k2to dim(GP) + kn as claimed.
The spaces intersect trivially because all monomials of polynomials in the second
space are divisibly by some xjwith jnkand none of the monomials of polyno-
mials in the first space are.
As ais the identity on W, we have iQ=iDfor inkand the iDare linearly
independent by Proposition 5.1.2 because Dis concise. It follows that the xjiQwith
1inkand nk<jnare all linearly independent.
8.3.1 Proof of Theorem 8.3.1
The thesis of Reichenbach [Rei16] contains a very detailed proof of Theorem 3.4.1, the
description of the stabilizer of detd. It can be adapted to verify Theorem 8.3.1, and
some parts of the proof require only slight modification or none at all. We therefore
refer to Reichenbach [Rei16] frequently and also make use of partial results therein.
We will denote by M:=Cd×dthe space of square matrices and recall that WMis
the subspace of traceless matrices. We will denote by Eij Mthe matrix with entry 1
in position (i,j)and zeros elsewhere. We begin with the important observation that
we only have to show GPGdetd:
8.3.4 Lemma. Let gGP. If there are S,TGLdsuch that g(A) = S·A·Tfor all
AW, then there is a d-th root of unity ζCsuch that T=ζ·S1.
Proof. Set I:=TS. Since gGL(W), we have SAT Wfor any AW. Hence,
AW: 0 =tr(SAT) = tr(ATS) = tr(AI).
Let Eij be the d×dmatrix with entry 1 in position (i,j)and zeros elsewhere. For
distinct indices i=j, we have 0 =tr(EijI) = Iij and therefore Iis a diagonal matrix.
97
Furthermore, from 0 =tr((Eii Ejj)I) = Iii Ijj it follows that all diagonal entries
of Iare equal to some ζ:=I11. Finally, pick some AWwith det(A)=0. From
det(A) = det(g(A)) = det(SAT) = det(S)·det(A)·det(T) = det(TS)·det(A),
it follows that 1 =det(TS) = det(I) = ζd.
Recall that a singular subspace of Mis a linear space LMsuch that det(A) = 0
for all AL. We denote by Lthe set of all maximal left ideals of Mas a ring. By
[Rei16, Satz 2.24], there is a bijection Pd1
=Lgiven by v L(v), where
L(v):={AM|vker(A)}.
Here, vdenotes a line in Cd. In particular, every element of Lis a singular subspace
of M. For L L, we set Lt:={AtAL}, the image of Lunder transposition. The
set Lt={LtL L}is the set of all maximal right ideals of M, see [Rei16, §2.3] for
the complete classification. Every left ideal is the intersection of maximal left ideals,
and analogously for right ideals.
By [Rei16, Thm. 2.38], the set I:=L∪Ltis the set of all singular subspaces of M
of maximal dimension. We will write LW:={LW|L L}and IW:=LWLt
W.
An important preliminary observation is the following proposition, which allows
us to argue essentially as in [Rei16]:
8.3.5 Proposition. The map I IW,L LWis a bijection. Furthermore, IWis
the set of all singular subspaces of Wof maximal dimension.
We require a sequence of lemmata which will be useful throughout the proof.
8.3.6 Lemma. Let LMbe a linear space and assume that SMsatisfies S·LW.
If Eij L, then Sji =0.
Proof. Let I:={(i,j)Eij L}and C[M] = C[xij 1i,jd]. We consider the
matrix A:=(i,j)IxijEij C[M]d×d. Then, 0 =tr(SA) = d
i,j=1Sji Aij =(i,j)ISjixij
as a polynomial in C[M]. Hence, we have Sji =0 for all (i,j)Iby comparing
coefficients.
8.3.7 Lemma. Let L1,L2 L. Then, Wdoes not contain L1Lt
2.
Proof. Assume for contradiction that I:=L1Lt
2W. By [Rei16, Korollar 2.28], there
are S,TGLdsuch that L1·T=L2·Stare both equal to the space of all matrices
with vanishing first column. Since L1is a left ideal and Lt
2a right ideal,
L:=S·I·T=S·(L1Lt
2)·T= (S·L1·T)(S·Lt
2·T)
= (L1·T)(S·Lt
2) = (L1·T)(L2·St)t
98
is equal to the space of matrices where the first row and the first column vanish. In
particular, Eij Lfor all i>1 and j>1. Since conjugation leaves the trace invariant,
(ST)1·L=T1IT W. By Lemma 8.3.6, this implies that (ST)1has zero entries
everywhere except for the first row and column. Hence, rk((ST)1)2<3d, a
contradiction because (ST)1GLd.
8.3.8 Lemma. Wdoes not contain any nonzero left or right ideal of M.
Proof. Since Wis invariant under transposition, it suffices to show that Wdoes not
contain any left ideal of M. For contradiction, assume that L L satisfies LWand
let rbe the maximum rank among elements of L, then r>0 because Lis nonzero. By
[Rei16, Satz 2.24], we have dim(L) = dr and by [FLR85, Theorem 2], a singular space
with this property admits matrices S,TGLdsuch that L:=S·L·Tis the space
of all matrices whose first drcolumns vanish. Because r>0, we have Eid Lfor
all i. Since (ST)1·L=T1LT W, Lemma 8.3.6 implies that the last row of (ST)1
vanishes, a contradiction.
Proof of Proposition 8.3.5.By definition, the map I IW,L LWis surjective. To
see that it is injective, assume for contradiction that there are I1,I2 I with I1=I2
and
J:=I1W=I2W.
By Lemma 8.3.8, we have dim(J) = d2d1. Furthermore, [Rei16, Lemma 2.29]
implies dim(I1I2)d22d+1. Lemmata 8.3.7 and 8.3.8 yield that I1I2is not
contained in W, providing the last inequality in
d2d1=dim(J) = dim(I1I2W)d22d.
This means d1, which is the contradiction we sought.
Let KWbe a singular subspace of maximal dimension. To finish the proof, we
have to show that K IW.
Claim. There is some L L with KL.
Proof of Claim. Note that dim(K)d2d1 because Kis of maximal dimension and
for example, the singular spaces in IWhave dimension d2d1. There are two cases
to consider:
Case 1 (d>3). In this case, dim(K)>d22d+2. By [FLR85, Theorem 3], any
singular space KMwith this property is contained in a left or right ideal of M.
Case 2 (d=3). In this case, we have dim(K)931=5. Assume for contradic-
tion that Kis not contained in any maximal left or right ideal of M. As Kis singular,
99
it is contained in some maximal singular subspace of M, which we recall from Sec-
tion 8.2. Since dim(K)>3 and Kis not contained in any left or right ideal, Kis
contained in a space that is equivalent to
L:={(0 0
0 0
)}.
Since dim(L) = 5dim(K), in fact Kmust be equivalent to L. By definition, there
are matrices S,TGLdwith L=S·K·T. It follows that (ST)1·L=T1KT W
and Lemma 8.3.6 implies that the last row and column of the invertible matrix (ST)1
vanish a contradiction.
It follows from the claim that KLW, the latter is a singular subspace of W.
Therefore, K=LWbecause Kis maximal, hence K IW.
With this at hand, we can mimic the six steps of the proof [Rei16, Thm. 4.2]. Recall
that we denote by tEnd(W)the transposition operator t(A):=At. The first step is
the following:
8.3.9 Proposition. Let gGP. After possibly replacing gby gt, the following holds:
For any K LW, we have g(K) LW.
Proof. We first note that for any K IW, we have g(K) IWbecause gmaps the
singular subspaces of Wto singular subspaces, and preserves the dimension.
Let K1,K2 LWand L1,L2 L with LiW=Ki. After possibly composing g
with t, we may assume that g(K1) LW. Furthermore,
dim(g(K1)g(K2)) = dim(g(K1K2)) (gis injective)
=dim(K1K2) (gis injective)
=dim(L1L2W) (by definition)
=dim(L1L2)1(Lemma 8.3.8)
=d22d1. [Rei16, Lemma 2.29]
If we had g(K2) Lt
W, then Lemma 8.3.7 and [Rei16, Lemma 2.29] would imply that
dim(g(K1)g(K2)) = d22d. Hence, we have g(K2) LWas well. Since K2was
arbitrary, the statement follows.
For the rest of the proof, fix some gGP. We now construct two bijective maps
φ,ψ:Pd1Pd1as follows: Note that gmaps any L(v)Wto some L(w)Wby
Proposition 8.3.9. Since LW
=L
=Pd1, we can define φand ψvia
g(L(v)W) = L(φ(v)) W,g(L(v)tW) = L(ψ(v))tW.
100
As gis a bijection, so are φand ψ. Moreover, φand ψare induced by semilinear
maps T:CdCdand S:CdCd, respectively. This follows from the upcoming
Lemma 8.3.10 and the fundamental theorem of projective geometry [Rei16, Theo-
rem 3.13].
8.3.10 Lemma. Both φand ψmap projective lines to projective lines.
Proof. Assume that u,v,wPd1are three pairwise different lines contained in
one plane, i.e., these projective points lie on the same projective line. Let L:=
L(u)L(v)L(w). Then, dim(L) = d22dby [Rei16, Satz 2.24, Korollar 2.27].
By Lemma 8.3.8, we have dim(LW) = d2d1. Thus,
d22d1=dim(g(LW)) = dim(a(L(v)W)g(L(u)W)g(L(w)W))
=dim(L(φ(u)) L(φ(v)) L(φ(w)) W).
Again by Lemma 8.3.8, we get that
d22d=dim(L(φ(u)) L(φ(v)) L(φ(w)))
and [Rei16, Satz 2.24] implies that the lines φ(u),φ(v)and φ(w)span a plane. The
argument for ψis completely analogous.
We have shown that g(L(v)W) = L(Tv)Wand g(L(v)tW) = L(Sv)tWfor
all v. The third step is to show that Sand Tare linear maps, so we have S,TGLd.
The proof is word for word the same as in [Rei16, Thm 4.2,Proof Step 3], except that
the matrix E11 has to be replaced with E13, which is traceless. This is possible because
we assumed d3.
Let E:=E11 +···+Edd be the identity matrix, so M=W(C·E). We extend g
to a map ¯
gGL(M)by declaring ¯
g(E):=Eand ¯
g|W:=g. We then define the map
u:M M
A St·¯
g(A)·T
Note that uGL(M)as well. For AL(v)W, we know g(A)·Tv =0, so in
particular we have u(A)v=St·g(A)·Tv =0, so u(L(v)W)L(v). Equivalently,
one has u(L(v)tW)L(v)t. We define
Dk:= (E11 +Ek1)(E1k+Ekk)
for k>1, then{D2, . . . , Dd}{Eij |i=j}is a basis of W. Certainly, CEij for i=jcan
be expressed as an intersection of maximal left and right ideals of M. We observe that
for k>2, we can also write
CDk=
L(e1+ek)
i/{1,k}
L(ei)
L(e1ek)t
i/{1,k}
L(ei)t
.
101
It follows that u(CDk) = CDkand u(CEij) = CEij for i=jand k>2. Hence, there
are certain µk,µij C×with u(Dk) = µkDkand u(Eij) = µijEij. We will show that all
these coefficients are identical:
8.3.11 Lemma. There is a µC×such that µ=µkand µ=µij for all i=jand k>2.
Proof. For three distinct indices i,j,k(note that we require d3 here), the line
spanned by Eij +Ekj is contained in Wand can be expressed as the intersection of
maximal left and right ideals. Hence, there is some µijk C×with
µijEij +µkjEkj =u(Eij +Ekj) = µijk ·(Eij +Ekj) = µijkEij +µijkEjk,
showing that µij =µijk =µkj. Similarly, one can show that µij =µik. It follows that
there is a µC×with µ=µij for all i=j. We are left to show that µk=µholds
for all k>2. For this purpose, let j/{1, k}be a third index and observe that the line
spanned by Dk+Ej1Ejk is contained in Wand can be expressed as the intersection
of maximal left and right ideals. Indeed, it is equal to
L(e1+ek)
i/{1,k}
L(ei)
L(e1ek)tL(e1ej)
i/{1,j,k}
L(ei)t
.
From this, we get certain νjk C×with
µkDk+µEj1µEjk =u(Dk+Ej1Ejk) = νjk ·(Dk+Ej1Ejk)
=νjkDk+νjkEj1νjkEjk
and therefore, µk=νjk =µ.
We conclude that u=µ·idM. We can replace Sby µ1Sand achieve that uis the
identity. Consequently, we are done by Lemma 8.3.4.
8.4 The Boundary of the 4×4Determinant
The next logical step would be to study det4and det5to accumulate a more solid
foundation of examples. The maximal linear subspaces of Z(det4)up to action of the
stabilizer are known [EH88, Cor. 1.3], [FLR85], they are as follows:
C0={(0
0
0
0 )}and
C1={(
0 0
0 0
0 0 )}
102
are compression spaces. The semistable spaces are the following:
L1={( 0αβ
α0γ
βγ0
0 0 0 )
α,β,γC},
L2={( γ δ 0 0
0 0 γ δ
α0β0
0α0β)
α,β,γ,δC},
L3={(βδ0 0
α0γδ
δ0β0
γ α 0β)
α,β,γ,δC}.
By the computation given in Program 8.3, one can obtain the following result:
8.4.1 Proposition. For A,BWwe letA,B:=tr(AtB). With respect to this bilinear
form, we choose orthogonal complements W=LiKifor i{1,2}.
There are surjective linear maps aiEnd(W,Li)and biEnd(W,Ki)such that
ai+bi·tis an approximation path from det4to a polynomial Qi(det4)and
(Qi)is a component of (det4)for i{1, 2}.
Remark. The polynomial Q1is equal to
Q1=tr
(0x1x2z3
x10x3z2
x2x30z1
0 0 0 t)
·(c e f 0
e b d 0
f d a 0
y3y2y10)
=t·(a·x2
1+b·x2
2+c·x2
3+d·x1x2+e·x2x3+f·x1x3)
(y1x1+y2x2+y3x3)(z1x1+z2x2+z3x3),
as a polynomial in C[t,a,b,c,d,e,f,x1,x2,x3,y1,y2,y3,z1,z2,z3]. Note that the first
polynomial from Theorem 8.0.1 appears with the factor tas the first summand here.
The polynomial Q2is equal to
Q2=tr
(x1x20 0
0 0 x1x2
x30x40
0x30x4)
·(y1y2b1b2
b3b4y1y2
y3a1y4a2
a3y3a4y4)
=x2
1(x3a2x4a1) + x2
2(x3a4x4a3) + x2
3(b1x2b2x1) + x2
4(b4x1b3x2)
(y1x2x3x4x1y2x3x4+x1x2y3x4x1x2x3y4),
as a polynomial in C[x1,x2,x3,x4,y1,y2,y3,y4,a1,a2,a3,a4,b1,b2,b3,b4].
Remark. As mentioned in [Lan15], these components were first observed by J. Brown,
N. Bushek, L. Oeding, D. Torrance and Y. Qi.
Remark. We used several approximation paths a+bt for det4with aEnd(W,L3).
Every such path gave a degeneration Qof det4whose orbit closure Qhad codimen-
sion one in (det4), i.e., Qwas not a component of the boundary.
103
1
2from sympy import *
3a,b,c,d,e,f,p,q,r,s,u,v,w,x,y,z = X = symbols(
4 .join(c for cin ’abcdefpqrsuvwxyz’))
5
6def OrbitDimension(P,X):
7P = Poly(P,X)
8Df = [ Poly(x)*diff(P,y) for xin Xfor yin X ]
9Mn = list(set(sum((Poly(Q).monoms() for Qin Df),[])))
10 Mx = Matrix([[ Q.coeff_monomial(m) for Qin Df ]
11 for min Mn ])
12 return Mx.rank()
13
14 L1 = Matrix([
15 [ 0, x,-y, p ],
16 [-x, 0, z, q ],
17 [ y,-z, 0, r ],
18 [0,0,0,s]])
19 K1 = Matrix([
20 [ c, e, f, 0 ],
21 [ e, b, d, 0 ],
22 [ f, d, a, 0 ],
23 [u,v,w,0]])
24 Q1 = (L1.adjugate()*K1).trace()
25
26 L2 = Matrix([
27 [ p, q, 0, 0 ],
28 [ 0, 0, p, q ],
29 [ r, 0, s, 0 ],
30 [0,r,0,s]])
31 K2 = Matrix([
32 [ a, b, c, d ],
33 [ e, f,-a,-b ],
34 [ u, v, w, x ],
35 [ y,-u, z,-w ] ])
36 Q2 = (L2.adjugate()*K2).trace()
37
38 for Qin (Q1, Q2):
39 if OrbitDimension(Q,X)==(16*16)-(2*4*4-2)-1:
40 print("Found Component:\n%s" %str(Q.expand()))
Program 8.3: Computing degenerations of det4with Python [Pyt;Sym].
104
We note an interesting fact about the geometry of the spaces Ei:=GEnd(W,Li)
where G:=Gdet4, which might be related. From the proof of [EH88, Thm. 1.2], one
can deduce that (E2)ss is the disjoint union of Ess
2and Ess
3. In other words, Ess
3is the
complement of Ess
2in its closure, its “boundary” so to speak.
This leaves the following question:
8.4.2 Question. Does (det4)contain any irreducible component other than the or-
bit closures of Q1,Q2and the traceless determinant?
It is also noteworthy that L1has the space of skew-symmetric 3 ×3 matrices as its
“primitive part”, see [EH88] for a definition. We also ask if the construction of this
component generalizes:
8.4.3 Question. Let d=2kN. Let W:=Cd×dand consider the linear subspace
L:=
A
.
.
.
0··· 0
AC(d1)×(d1),
A=At
Clearly, Lis a singular subspace of detd.
We ask whether there exists a vector space complement W=LUand linear
maps a:WLand b:WUsuch that a+bt is an approximation path from detd
to a polynomial Qwhose orbit closure is an irreducible component of (detd).
We end with the sobering remark that the 5 ×5 case is already quite substantially
more difficult. The annihilator Adet5contains infinitely many orbits under the action
of Gdetd, as shown in [Bor+16, Theorem 5]. This means that for general d, the Gdetd-
orbit structure of Adetdis presumably quite involved. Understanding this structure is
more or less equivalent to the classification of all maximal linear subspaces of detd.
105
Chapter 9
The Binomial
It would be desirable to have a description of (Pd)for a complete family of poly-
nomials P= (Pd)dN. A trivial example is (mnd)dNbecause it is closed, as noted in
Lemma 4.3.1. However, this is of little educational value. Our suggestion is to study
the polynomial family
bnd:=x1···xd+y1···yd, (1)
the generic binomial. We assume d3 throughout.
9.0.1 Remark. Note that bnd(detd). For example,
bn5=det
x1y10 0 0
0x2y20 0
0 0 x3y30
0 0 0 x4y4
y50 0 0 x5
and this construction generalizes easily.
Furthermore, bndis also a special case of another important polynomial family,
namely the sums of products polynomial k
i=1d
j=1xkj, see [CKW10, Chapter 11] and
[Kay12].
9.0.2. Here is a brief summary of our analysis of the binomial:
(1) Let S End(W)be the set of noninvertible endomorphisms. The set B:=bndS
is an irreducible component of (bnd). It is not an orbit closure.
(2) There is only one semistable linear subspace of Z(bnd). Using it, we construct
a linear degeneration Qof bndsuch that Qis an irreducible component of the
boundary.
(3) We prove that the scheme ˆ
Abndis generically smooth, in particular generically
reduced. If it is reduced and its components intersect transversally, then (bnd)
has exactly these two irreducible components.
We let W:=Cd×Cdwith coordinates xion the left factor and yjon the right one,
so we have bndV:=C[W]d. It is easy to see that bndis concise.
107
9.1 Stabilizer and Maximal Linear Subspaces
We will denote by Hd:=Gbndthe stabilizer group of the binomial. The main goal of
this section is to prove the following result:
9.1.1 Theorem. Let W:=Cd×Cdwith d3. The stabilizer group HdGL(W)is
reductive of dimension 2d2. Moreover,
(1) The identity component of the stabilizer of Pis the group
H
d={diag(s1, . . . , sd,t1, . . . , td)d
i=1si=d
i=1ti=1}.
(2) Let KGL(W)be the group generated by
the map t:WW,(v,w) (w,v), and
the group Sd×SdGLd×GLdGL(W)of permutation matrices that per-
mute the first and last dcoordinates among themselves, respectively.
Then, K
=SdZ2and Hd=K·H
dhas|K|=2·d!·d! connected components.
9.1.2 Corollary. By Theorem A.1.9, dim((bnd)) = 4d22d+2.
One can prove Theorem 9.1.1 by means of the explicit description of Lie(Hd)from
Corollary 7.2.3, but another method is to study the action of Hdon linear subspaces
of the vanishing locus Z(bnd)W=Cd×Cd. It is straightforward to provide linear
subspaces of Z(bnd): The most canonical choice are the spaces Lij :=Z(xi,yj)for
1i,jd. Furthermore, there is the space
L0:={(w1, . . . , wd,ζw1, . . . , ζwd)|w1, . . . , wdC}W,
where ζCwill always denote a fixed d-th root of 1, i.e., we have ζd=1. For the
proof of Theorem 9.1.1, we will prove the following proposition as an auxiliary result.
9.1.3 Proposition. The maximal linear subspaces of Z(bnd)consist of the Lij and the
spaces h(L0)for hHd. Furthermore, the Lij are unstable.
Proof of Theorem 9.1.1.It is easy to see that the elements of H
dand Kstabilize bnd.
We are left to show that HdK·H
d. The elements of Hdact on the set of linear
subspaces of Z(bnd)of any fixed dimension, so by Proposition 9.1.3 they permute
the Lij. Dually, the action of Hdon Wpermutes the spaces L
ij :=xi,yj Wfor all
1i,jdin the sense that for all hHd, there are 1 r,sdsuch that
L
ij h:=xih,yjh=xr,ys=L
rs.
108
Let hHdand 1 kd. We claim that there is an index 1 idsuch that the
linear form xkhis either a scalar multiple of xior yi. To this end, let 1 i,j,r,sd
be such that L
k1h=L
ij and L
k2h=L
rs, then
xkh=xkh= (L
k1L
k2)h=L
ij L
rs.
Hence, the two planes L
ij and L
rs intersect in the line spanned by (xkh), in particular
this intersection is nontrivial. It follows that i=ror j=sbecause for i=rand j=s,
we have L
ij L
rs ={0}. Therefore, either xkh=xior xkh=yjas claimed.
In the same manner, we can show that ykhis some scalar multiple of a coordinate
function. This means that his the product of a permutation and a diagonal matrix.
The result is a straightforward corollary.
We are left to show Proposition 9.1.3. The proof has been split up into the following
three lemmata:
9.1.4 Lemma. A maximal linear subspace LZ(bnd)which is contained in a coordi-
nate hyperplane is equal to Lij for some choice of iand j.
Proof. Assume LZ(xi). It follows that any (v1, . . . , vd,w1, . . . , wd)Lmust satisfy
w1···wd=0. Consequently, LLi1···Lid. It follows that L=Lij for some j.
9.1.5 Lemma. The spaces Lij are unstable.
Proof. It is sufficient to show that L1dis unstable since Lij =h(L1d)for an appropriate
permutation hHd. To see that L1dis unstable, let a:WL1dbe any linear map
and consider
hε:=diag(ε1d,ε, . . . , ε,ε1d)Hd,
where ε1dis in the first and last position. Note that the representing matrix of ahas
vanishing first and last row. Therefore, hεa0 as ε0.
9.1.6 Lemma. Let LZ(bnd)be a maximal linear subspace which is not contained
in any coordinate hyperplane. Then, there is an hHdsuch that h(L) = L0.
Proof. Since Lis not contained in any of the coordinate hyperplanes, there is a point
(u1, . . . , ud,v1, . . . , vd)Lsuch that ui=0 and vi=0 for all 1 id. Let uCbe
such that ud=u1···ud. We apply the scaling matrix
diag(u
u1, . . . , u
ud,v2···vd
ζd1ud1,ζu
v2, . . . , ζu
vd)H
d
to the space Land thereby achieve that (u, . . . , u,ζu, . . . , ζu)L. By scaling, we
get that p:= (1, . . . , 1, ζ, . . . , ζ)L. We denote by Kthe discrete part of Hd, i.e.,
109
the finite group of permutations that stabilize bnd. We now claim that LK.L0,
so Lis contained in a finite union of translates of L0. This will imply the state-
ment because Lhas to be equal to one of the translates of L0. For the proof, let
q= (u1, . . . , ud,v1, . . . , vd)Lbe any point different from p. Since the entire line
containing pand qmust be a subset of Land subsequently a subset of Z(bnd), this
means that bndvanishes at Tp qfor any TC. In other words,
0=
d
i=1
(Tui) +
d
i=1
(ζTvi) =
d
i=1
(Tui)
d
i=1
(Tζ1vi)
as an identity of polynomials in C[T]. Hence, ui=ζ1viup to a permutation which
stabilizes bnd.
For future reference we consider the matrix
b:=(I0
ζI0), (2)
then End(W,L0) = bGL(W)is the set of all endomorphisms that map into L0. From
Proposition 9.1.3, we obtain:
Ass
bnd={ha|aEnd(W,L0)ss}=HdEnd(W,L0)ss =HdbGL(W)ss. (3)
9.2 The First Boundary Component
There are certain natural degenerations of bndwhich we will study here. We introduce
the notation PQ:PQand say that Pis linearly equivalent to Qin this case.
9.2.1 Remark. An important property of linear equivalence is the fact that PQ
implies GP
=GQ, indeed if Q=Pgthen it is easy to see that GQ=g1GPg.
For the rest of this section, we also set
S:={aEnd(W)|rk(a)<2d},
so Sis the hypersurface of noninvertible endomorphisms on W=C2d. Our main
result of this section is the following:
9.2.2 Proposition. Let B(bnd)be the closure of bndS. Then, Bhas codimension
one in (bnd)and it is an irreducible component of (bnd).
For the proof, we define the polynomials
Qr,s=x1···xd+y1···yd1·(r
k=1
xk+
s
k=1
yk)
for rdand s<d. Note that Qr,sbndS. We defer the proof of the following
auxiliary result:
110
9.2.3 Lemma. The polynomials Qd1,d1and Qd,d1are elements of bndS whose
orbits are disjoint. Furthermore, both orbits are of codimension 2 in (bnd).
Proof of Proposition 9.2.2.By definition, Bis an irreducible closed subset of (bnd),
because it is the closure of the image of the irreducible variety Sunder the orbit map.
Let us check that it is contained in (bnd): The elements of bndS are not concise.
Since that is a closed condition by Remark 5.1.3, no element of Bis concise. As all
elements of (bnd)are concise, we know that Bmust lie in the complement.
We are left to determine the dimension of B. By Lemma 9.2.3,Bhas codimension at
most 2. Assume for contradiction that the codimension of Bin (bnd)is equal to 2.
The orbit closure (Qd,d1)is an irreducible closed subset of Band it has the same
dimension as Bby Lemma 9.2.3 and our assumption. This implies that (Qd,d1) = B.
We have Qd1,d1B=(Qd,d1), but Qd1,d1/(Qd,d1)by Lemma 9.2.3. This
means Qd1,d1(Qd,d1). However, the orbit of Qd1,d1has the same dimension
as the orbit of Qd,d1, which is a contradiction to Theorem A.1.9.(3).
It follows that the codimension of Bis at most 1, and since it is completely con-
tained in the boundary, this implies that it is a component.
Proof of Lemma 9.2.3.Let r{d1, d}and Q:=Qr,d1. We define the linear form
:=x1+···+xr+y1+···+yd1, so Q=x1···xd+y1···yd1·. Furthermore, we
denote by a S the endomorphism that satisfies yda=and ais the identity on all
other coordinates, so that Q=bnda. We will compute the dimension of GQand the
number of its connected components. We will see that the former does not depend
on r, whereas the latter does this will imply the lemma by Remark 9.2.1. To obtain
a description of GQ, we study its action on the set of linear subspaces of Z(Q).
Claim. If LZ(Q)is a linear subspace with dim(L) = 2d2, then there are indices
1i,jdsuch that L=Z(xi,yja).
For j<d, this means L=Lij =Z(xi,yj)and otherwise L=Z(xi,).
Proof of Claim. Since Q=bnda, we know that a(L)is a linear subspace of Z(bnd), so
the space a(L)is contained in one of the spaces from Proposition 9.1.3. We first show
that there are 1 i,jdwith a(L)Lij.
The form ydvanishes on im(a), but it is straightforward to verify that it does
not vanish on h(L0)for any hHd. Therefore, a(L)=L0, but it might be properly
contained in it. We show that this is not possible: Since d3, we have
dim(a(L)) dim(L)1=2d3d=dim(L0).
Hence, we have a(L)Lij. In particular, the linear forms xi=xiaand yja
both vanish on L. The latter is equal to yjfor j<dand otherwise it is equal to . In
either case however, the two forms are linearly independent and therefore cut out the
space L.
111
The action of GQpermutes these linear spaces. Dually, its action on Wper-
mutes the spaces xi,yja W. Reminiscent of the proof of Theorem 9.1.1, we
can conclude in the same fashion that the action of GQmaps every element of the set
M:={x1, . . . , xd,y1, . . . , yd1,}to a scalar multiple of an element of M. Of course, it
can map ydto any linear form which is linearly independent from the other coordi-
nates. Hence, there is a group KGL2d1such that GQadmits a matrix representa-
tion
GQ=
0
h.
.
.
0
··· α
hK,
αC×
=K×C2d1×C×. (4)
Claim. K is a finite group of order|K|=r!·d2·(d1)!.
This claim implies the lemma as follows: We have dim(GQ) = 2d=dim(Hd) + 2,
the latter by Theorem 9.1.1. From Theorem A.1.9, we conclude that Qhas codimen-
sion two in (bnd). If Qd1,d1and Qd,d1were in the same orbit, then their stabi-
lizers would be conjugate and in particular, they would both have the same number
of connected components. However, the number of connected components of GQis
equal to|K|, which depends on r.
We are therefore left to verify our claim. We first study the subgroup
Π:={πKπC×}
of Kand observe that its order depends on r:
Claim. We have Π
=Zdo(Sr×Sd1), where Zdcorresponds to scaling all variables
with a d-th root of unity simultaneously and Sr×Sd1permutes the x-variables and
the y-variables that occur in among themselves.
Proof of Claim. We first check that every πΠis a product of a diagonal and a
permutation matrix. Let M:={Cx1, . . . , Cxd,Cy1, . . . , Cyd1,C}be the set of lines
spanned by elements of M. We know that πinduces a permtutation of Mwhich
has Cas a fixpoint. In particular, πinduces a permutation of the lines spanned by
the first 2d1 coordinates. This is precisely what we claimed.
Next, we show that all diagonal matrices in Πare scalar and correspond to mul-
tiplication with a d-th root of unity. Let πΠbe a diagonal matrix and let ξC×
be such that π=ξ. It is straightforward to see that this implies xiπ=ξxiand
yjπ=ξyjfor all 1 i,jd1. As πstabilizes
Q=x1···xd+y1···yd1·(x1+···+xr+y1+···+yd1),
112
it must, for instance, stabilize the monomial y2
1·y2···yd, whence ξd=1. Let ϑC×
be such that xdπ=ϑxd. As πalso stabilizes the monomial x1···xd, we also get
ϑξd1=1, hence ϑ=ξ.
We now assume that πΠis a permutation and show that πdoes not map
any x-variable to a y-variable. Observe that any monomial of Qthat is divisible by
ay-variable is divisible by d1 of the y-variables. Assume for contradiction that
some x-variable is mapped to a y-variable. Then, since (x1···xd)π=d
i=1(xiπ)
is a monomial of Q, we conclude that d1 of the d x-variables must be mapped to
y-variables. In turn, all y-variables must be mapped to x-variables. As Qcontains the
monomial y2
1·y2···yd1, this implies the contradiction that Q=Qπalso contains
the monomial x2
1·x2···xd1.
We are left to verify that for r=d1, every permutation πΠleaves xdinvariant.
Assume for contradiction that there is some id1 such that xiπ=xd. Then, Q=
Qπcontains the monomial (y1···yd1·xi)π=y1···yd1·xd, a contradiction.
As|Π|=r!·d·(d1)!, we are left to verify that K/Πcontains dresidue classes,
by Lagrange’s Theorem. For any 1 j<d, denote by gjKthe map that satis-
fies yjgj=and is the identity on all other variables. Hence, gj=yjand it
is easy to check that gjstabilizes Q. We will show that
K/Π={Π,g1Π, . . . , gd1Π}. (5)
To this end, let gKbe such that gis not a scalar multiple of . Then, there is
some βC×with g=β·z, where zis a variable. Furthermore, there is an ηC×
such that gmaps a variable ˜
zMto η·. By applying an element of the subgroup Π,
we may assume that z=˜
z, i.e., we have zg=η·. We may furthermore assume
that gmaps all variables other than zto scalar multiples of themselves. By studying
the monomials of Q, it is straightforward to check that z=yjfor some 1 j<d.
There are certain scalars αiand βksuch that
β·yj=g=(r
i=1xi+d1
k=1yk)g=(r
i=1αixi)+(k=jβkyk)+η·.
With βj:=β, we obtain that η·=r
i=1αixi+d1
k=1βkyk, so αi=ηand βk=η
for all iand k. In particular, η=β. It is now again easy to check that ηis a d-th root
of unity, so we have achieved ggjKand proved (5).
It is natural to ask what a generic elements of B:=bndS looks like. It turns out
that Bis not an orbit closure. Instead, for the one-parameter family of polynomials
ˆ
Qt:=t·x1···xd+y1···yd1·(d
k=1
xk+
d1
k=1
yk),(tC×)
we prove the following result.
113
9.2.4 Proposition. Let B:=bndS. The union U:=tC×(ˆ
Qt)is dense in B
and contains an open subset of B. In particular, a generic element of Bis linearly
equivalent to ˆ
Qtfor some tC×.
For the proof, we require the following technical lemma whose proof we postpone
until after the proof of Proposition 9.2.4.
9.2.5 Lemma. Let a S with dim(ker(a)) = 1 and P:=bnda. Then, one of the
following is true:
(1) There is some tC×such that Pˆ
Qt.
(2) There are natural numbers rdand sd1 such that PQr,s.
Furthermore, the codimension of Pin Bis at least 1, so it is a proper closed subset
of B.
Proof of Proposition 9.2.4.The map ω:S B,a bndais a dominant morphism.
The set S1:={a S |dim(ker(a)) = 1}is open and dense in S. Since ωis dominant,
this means that ω(S1)is dense and contains a nonempty open subset, the latter by
[TY05, 15.4.3]. By Lemma 9.2.5, we have ω(S1)\Z=U, where
Z:=d
r=1d1
s=1(Qr,s).
By Lemma 9.2.5,Zis a finite union of proper closed subsets of B, therefore Zitself is
a proper closed subset of B. Consequently, Uis dense and contains a nonempty open
subset.
9.2.6 Remark. Of course, it is equally true that the set
tC×
(ˆ
Qt)
1rd
1s<d
(Qr,s)
is dense in Band contains an open subset, the statement of Proposition 9.2.4 merely
emphasizes the fact that the Qr,sare just a finite number of “special” cases.
Proof of Lemma 9.2.5.By Lemma 8.3.3, we have dim(GP)2dand by Theorem 9.1.1,
this means dim(GP) = dim(Hd) + 2. This implies that Phas codimension at least 2
in (bnd)by Theorem A.1.9. Proposition 9.2.2 then yields that Phas codimension
at least 1 in B.
We will show later that there is a binary vector ε{0,1}2d1and some tC×
such that P=bndais linearly equivalent to the polynomial
Pε,t:=t·x1···xd+y1···yd1·(d
k=1
εkxk+
d1
k=1
εd+kyk)(6)
114
We check first that Pε,tis linearly equivalent to some ˆ
Qtor to some Qr,s. Let rbe the
number of indices 1 kdwith εk=1 and let sthe number of indices 1 k<d
with εd+k=1. By permutation of the variables we get
PPε,tt·x1···xd+y1···yd1·(r
k=1
xk+
s
k=1
yk).
If r=dand s=d1, the above means Pˆ
Qt. If we have r<d, then we can scale
xdby t1to achieve PQr,s. The final case is s<d1. In this case, choose ϑCbe
so that ϑd=t. Scaling yd1by ϑ1dand all other variables by ϑalso yields PQr,s.
We will compute the codimension of (Pε,t)last and first show that PPε,t. We
assume d=3, making the proof easier to read and follow. It generalizes easily to
general d. We will show that there are hHdand gGL(W)such that
hag=
t0 0 0 0 0
0 1 0 0 0 0
0 0 1 0 0 0
0 0 0 1 0 0
0 0 0 0 1 0
ε1ε2ε3ε4ε50
(7)
for a certain binary vector εand tC×. Then,
Pε,t=bnd(hag) = bnd(ag) = Pg.
To verify (7), we will apply elements of Hdfrom the left and perform arbitrary column
operations to transform ainto the right hand side of (7). By column operations, we
can achieve
a=
0 0 0 0 0
00 0 0 0
0 0 0 0 0
0000 0
0 0 0 0 0
∗∗∗∗∗0
with all diagonal entries nonzero. Composing from the right with another element
of GL(W), we may scale all columns to achieve that
a=
0 0 0 0 0
00 0 0 0
0 0 0 0 0
0 0 0 0 0
0 0 0 0 0
ε1ε2ε3ε4ε50
for some binary vector ε. After precomposing awith an element of Hd, we may
assume that there are ˜
η,ηC×with
a=
˜
η0 0 0 0 0
0˜
η0 0 0 0
0 0 ˜
η0 0 0
0 0 0 η0 0
0 0 0 0 η0
ε1η ε2η ε3η ε4η ε5η0
.
115
We now scale every column with η1and set ϑ:=˜
ηη1, obtaining the form
a=
ϑ0 0 0 0 0
0ϑ0 0 0 0
0 0 ϑ0 0 0
0 0 0 1 0 0
0 0 0 0 1 0
ε1ε2ε3ε4ε50
Finally, set t:=ϑd. We multiply by diag(ϑd1,ϑ1, . . . , ϑ1, 1, . . . , 1)Hdfrom the
left to achieve
a=
t0 0 0 0 0
0 1 0 0 0 0
0 0 1 0 0 0
0 0 0 1 0 0
0 0 0 0 1 0
ε1ε2ε3ε4ε50
.
9.3 The Second Boundary Component
We will use the linear subspace L0to construct a concise degeneration Qof bnd. We
choose the linear map b:WL0as in (2), i.e., we have xib=xiand yib=ζxifor
all 1 id, where ζd=1. Consider the linear approximation
bnd(b+t·idW) = (t+1)d(x1···xd) +
d
i=1
(ζxi+tyi).
The coefficient of tin this expression is the polynomial
˜
Q:=
d
k=1(yk·ζd1d
=1
=kx)+d·
d
=1
x.
Scaling each xby d
d1and each ykby ζd+1d
dd1, we get the polynomial
Q:=
d
k=1(yk·d
=1
=kx)+
d
=1
x,
which satisfies Q˜
QGL(W)(bnd).
We now prove that Qis concise and compute the dimension of its stabilizer:
9.3.1 Proposition. The polynomial Qis concise and dim(G
Q) = 2d1.
Furthermore, the identity component G
Qof GQconsists of all matrices
(s0
u t )=
s10
...0
0sd
u10t10
......
0ud0td
(8)
such that tj=i=js1
ifor all 1 jdand 1 =d
i=1si+d
j=1uji=jsi.
116
Proof. We will compute the Lie algebra of G
Q. In the process, we will also compute
the partial derivatives of Qand note in passing that Qis concise.
Denote by YGL(W)the closed subvariety of all matrices of the form (8), subject
to the listed conditions. We use coordinates (x1, . . . , xd,y1, . . . , yd)on the space W=
Cd×Cd. In other words, xiis mapped to sixiand yjis mapped to ujxj+tjyj. It is a
straightforward computation to verify that any element of Ystabilizes Q, so YG
Q.
This implies that we have dim(G
Q)dim(Y)3d(d+1) = 2d1 since any of
the given (d+1)equations can reduce the dimension by at most one.
We now show that the dimension of the Lie algebra Lie(GQ)is at most 2d1. This
gives us
2d1dim(GQ) = dim(Lie(GQ)) 2d1,
therefore we have equality. It follows in particular that Yis an irreducible component
of GQcontaining the identity, so Y=G
Q.
Recall the action of gl(W)on V=C[W]dfrom the proof of Theorem 9.1.1. We
again consider the elements of gl(W)as block matrices (s v
u t )where s,u,v,tCd×d.
Such a block matrix is in Lie(GQ)if and only if
0=
d
i=1
d
j=1(sij ·(xiQ
xj)+uij ·(xiQ
yj)+vij ·(yiQ
xj)+tij ·(yiQ
yj)). (9)
We set µj:=x1···xd
xj=Q
yjand similarly µjk :=x1···xd
xjxk. Then, we have
Q
xj=
d
k=1
k=j
yk·
d
=1
=j,=k
x
+
d
=1
=j
x=
d
k=1
k=j
ykµjk +µj.
We note that the partial derivatives of Qare all linearly independent, hence Qis
concise this proves part of the statement. Plugging this into (9) yields
0=
d
i=1
d
j=1(uijxiµj+tijyiµj+(
k=j
sijxiykµjk)+sijxiµj+(
k=j
vijyiykµjk)+vijyiµj)
=
d
i=1
d
j=1((sij +uij)xiµj+ (tij +vij)yiµj+(
k=j
sijxiykµjk)+(
k=j
vijyiykµjk)).
(10)
We have summarized some coefficients of certain monomials appearing in the
right hand side of (10) in Figure 9.3.1. Coefficient 6implies that vis the zero matrix
and Coefficient 5implies that s=diag(s1, . . . , sd)is diagonal. Note that we can pick
any k/{i,j}for Coefficients 5and 6, and we require the condition d3 for such a k
117
No. Monomial Condition Coefficient
(1) xiµji=j sij +uij
(2) xjµjd
i=1sii +d
i=1uii
(3) yiµji=j tij +vij +sij
(4) ykµktkk +vkk +j=ksjj
(5) xiykµjk i=j,k/{i,j}sij
(6) yiykµjk j=k vij
Figure 9.3.1: Coefficients of certain monomials occuring in (10)
to exist. Coefficient 1therefore means that u=diag(u1, . . . , ud)is also diagonal, and
Coefficient 2becomes d
i=1
ui+
d
i=1
si=0. (11)
Finally, Coefficient 3implies that t=diag(t1, . . . , td)is also diagonal and Coefficient 4
means
tk=d
j=1
j=k
sj.
Hence, the only parameters that remain are the diagonal entries of sand u, modulo
the relation (11). This proves dim(Lie(GQ)) 2d1 as required.
9.3.2 Corollary. The orbit closure (Q)is a component of (bnd)which is not con-
tained in bndEnd(W).
Proof. Proposition 9.3.1 implies that (Q)is an irreducible closed subvariety of the
boundary (bnd). It is not contained in bndEnd(W)because Qis concise. Fur-
thermore, this variety has dimension dim((Q)) = 4d22d+1=dim((bnd)) 1,
by Corollary 9.1.2 and Theorem A.1.9.
9.4 The Indeterminacy Locus
We will ommit bndas a subscript in this chapter to ease the notation, i.e., we denote
by Athe annihilator of bndand by ˆ
Athe subscheme of End(W)given by the equations
bnda=0 in the coordinates a.
We have constructed one concise component of the boundary, but we do not know
if there are more, potentially depending on d. Unfortunately, we cannot determine
the complete scheme structure of ˆ
Ass. We know that Ass =HdEnd(W,L0)ss by (3).
We will see that Ass has d! irreducible components and each of these components is
smooth. However, the scheme ˆ
Ass will be singular where these components intersect.
118
9.4.1 Remark. If ˆ
Ass is reduced and its irreducible components intersect transver-
sally, the blowup β:ΓEnd(W)ss with center ˆ
Ass =Ass is a sequence of smooth
blowups, one for each component [Li09, Thm. 1.3]. Moreover, in this case the stabi-
lizer of the binomial acts transitively on the irreducible components of β1(Ass)and
by Remark 7.3.12, it follows that (Q)is the only concise component of (bnd).
We emphasize that this remark is not to be mistaken as a conjecture it is quite
unclear whether or not ˆ
Ass is even reduced at the intersection of its components. That
said, we can still show the following:
9.4.2 Theorem. For any point y Ass that lies outside the intersection of two ir-
reducible components of Ass, the scheme ˆ
Ass is smooth at y. In particular, ˆ
Ass is
generically smooth and in particular generically reduced.
The remainder of this section is dedicated to the proof of Theorem 9.4.2.
9.4.3. Consider b=(I0
ζI0), then Ass = (HdbGL(W))ss. Let TSLdbe the group
of diagonal matrices with determinant one. We define
Y1:=H
dbGL(W) ={(s0
0t)( I0
ζI0)(g1g2
g3g4)(s0
0t)H
d,(g1g2
g3g4)GL(W)}
={(sa
ζa)sTand aCd×2d}.
For a permutation πSd, which we understand as a d×dpermutation matrix, we
furthermore define
Yπ:=(π0
0I)Y1={(πsa
ζa)sTand aCd×2d}.
9.4.4 Proposition. The irreducible components of Y:=HdbGL(W)are the Yπfor
all permutations πSdand these components are pairwise distinct.
In particular, Yhas d! irreducible components. A point (πsa
ζa)Yis semistable if
and only if all rows of aare nonzero.
Proof. Recall the permutation group KGL(W)from Theorem 9.1.1 which permutes
the first and last dcoordinates among themselves and which also swaps the xand y
coordinates simultaneously. Since Hd=K·H
d, we have Y=K·Y1.
We first note that Y1is invariant under the permutation switching the xand y
coordinates simultaneously: For sTand aCd×2d, set t:=ζ2s1Tand c:=
ζ1sa. Then, (ζa
sa )=(tc
ζc).
Furthermore, we will see that permutations on the first dcoordinates suffice: For
ϱ,σSdlet π:=ϱσ1and b:=σato see that
(ϱ0
0σ)(sa
ζa)=(ϱsa
σζa)=(πsb
ζb).
119
Assume now that Yπ=Yσfor two permutations πand σ. We have (π
ζI)Yπ=Yσ
and hence, there are aCd×2dand an sTwith (π
ζI)=(σsa
ζa). This implies that we
have a=Iand hence, σs=π, so s=σ1π. However since sis diagonal, this means
it must be the identity and therefore σ=π. This proves that the Yπare the pairwise
distinct irreducible components of Y.
The semistability condition follows immediately from the description of the maxi-
mal linear subspaces in Section 9.1.
9.4.5 Corollary. A point (πsa
ζa)Yπlies in the intersection of Yπwith another com-
ponent Yσof Yif and only if there are two rows of aCd×2dthat are scalar multiples
of one another.
Proof. If the i-th and j-th row of aare scalar multiples of one another, let σbe the
composition of πwith the transposition τ:= (i,j). By assumption, there is a tT
with tτa=sa, so we have πsa =πτtτa=σtτa=σsa. Hence, aYπYσ.
Conversely, assume that there is a permutation σ=πwith (πsa
ζa)=(σtc
ζc)for
some tTand cCd×2d. It follows immediately that c=a, hence πsa =σta.
Denoting by aithe i-th row of a, this means sπ(i)aπ(i)=tσ(i)aσ(i)for all 1 id. As
π=σ, there is an index iwith k:=π(i)=σ(i) =:j, so we have skak=tjaj, therefore
ak=s1
ktjaj.
The proof of Theorem 9.4.2 is now completed by the following proposition.
9.4.6 Proposition. Let aCd×2dbe such that no row of ais a scalar multiple of
another. Let also πSdand uT. Then, y:=(πua
ζa)is a smooth point of ˆ
Ass.
Proof. We first note that dim(Y) = dim(Y1) = 2d2+d1, hence we have to show that
the tangent space Tyˆ
Ass has the same dimension.
As Hd×GL(W)and Sdact by automorphisms on ˆ
Ass, the tangent space at yis
isomorphic to the tangent space of any element in the Hd×GL(W)orbit of y. Hence,
we may assume y=(a
ζa).
Note that each row of ais nonzero. After applying some column operation
from GL(W)we may assume that ai1=0 and the last dcolumns of aare zero.
Scaling the first column by the inverse of a11 ···ad1with another column operation,
we achieve that a11 ···ad1=1. We then apply a row scaling operation from Hdto
achieve ai1=1 for all 1 id. We set ai:=xia=x1+d
j=2aijxj. Since the aiare
not scalar multiples of one another, we achieve ai=x1+cix2+˜
aiwith further column
operations, where the ciare all distinct and the ˜
aiare linear forms in x3, . . . , xd.
120
Let b=(s v
u t )TyEnd(W) = End(W)be a tangent vector, where s,v,u,tCd×d.
The polynomial bnd(a+bT)is equal to
d
i=1(ai+
d
j=1
sijxjT+
d
j=1
vijyjT)+
d
i=1(ζai+
d
j=1
uijxjT+
d
j=1
tijyjT).
Set fi:=k=iak, then the coefficient of Tin this expression is equal to
d
i=1
d
j=1
sij fixj+vij fiyjζ1d
i=1
d
j=1
uij fixj+tij fiyj
=
d
i=1
d
j=1(sij ζ1uij)fixj+(vij ζ1tij)fiyj. (12)
By the Jacobian criterion, bTyˆ
Ass is equivalent to the vanishing of (12) as a poly-
nomial in the xjand the yj. We first note that the fiC[x1, . . . , xd]d1are linearly
independent polynomials: Indeed,
fi=˜
fi+
k=i
(x1+ckx2+˜
ak) =
k=i
(x1+ckx2)(13)
for certain polynomials ˜
fiC[x1, . . . , xd]d1whose monomials all contain some vari-
able xjwith j>2. Hence, it is sufficient to show that the gi:=k=i(x1+ckx2)are
linearly independent. Assume that 0 =d
i=1λigifor certain λiC. For any k, we can
evaluate this expression at the point (ck,1)and since gkis the only function from
among the githat does not vanish, it follows that λk=0.
Since we have established that the fiare linearly independent and do not use the
y-variables, it follows immediately that the coefficients of fiyjin (12) must all vanish,
i.e., we have v=ζt. This constitutes d2linear conditions.
If we can show that the polynomials fixjspan a subspace of dimension at least
d2d+1 inside C[x1, . . . , xd]d, we are done: It gives us at least 2d2d+1 linear
conditions on the parameters (s v
u t )and hence,
dimTyˆ
Ass 4d2(2d2d+1) = 2d2+d1=dim(Y).
Since the other inequality always holds, this implies equality. Now, recall from (13)
that fi=˜
fi+gi. For 1 kd, let Ukbe the span of all monomials that have combined
degree kin{x1,x2}, so we have C[x1, . . . , xd]d=d
k=0Uk. We have fixjd1
k=0Ukfor
all 2 <jdand the projection to Ud1is equal to gixj. These polynomials span a
space of dimension d22dbecause the giare linearly independent and do not contain
the variables xjwhen 2 <jd.
For j{1,2}, the projection of fixjto Udis equal to gixjand it will suffice to show
that these polynomials span a space of dimension at least d+1. Write these vectors in
121
coordinates with respect to the basis (x0
1xd
2,x1
1xd1
2,x2
1xd2
2, . . . , xd
1x0
2), then they form
the columns of a (d+1)×(2d)matrix
M:=
g1x1··· gdx1g1x2··· gdx2
0··· 0
g
g
0··· 0
with gGLdbecause the giare linearly independent. We have
M·(g10
0g1)=
··· 1
1...
...1
1 ···
which is easily seen to have full rank.
As explained in Remark 9.4.1, it would be interesting to know whether ˆ
Ass is
reduced everywhere:
9.4.7 Question.
(1) Is the scheme ˆ
Ass
bndCohen-Macaulay?
(2) Is the scheme ˆ
Ass
bndreduced?
Remark. Note that (1) implies (2) by Theorem 9.4.2 and the fact that Cohen-Macaulay
schemes have no embedded components [GW10, Prop. 14.124].
122
Part III
Appendix
123
Appendix A
Algebraic Groups and Representation Theory
Throughout this chapter, we fix an algebraically closed field Cof characteristic zero.
We assume knowledge of classical algebraic geometry, see [Har06;Har95] and also
[Kra85, Anhang]. We denote the coordinate ring of an affine C-variety Xby C[X]. We
also use the notation [n]:={1, . . . , n}.
A.1 Algebraic Semigroups and Groups
If Eis a semigroup acting on a set X, we usually denote this action by a dot, so
the action of gEon xXis denoted g.x. The set Ex:={gE|g.x=x}is the
stabilizer of xX. The orbit of xunder the action of Eis the set E.x:={g.x|gE}.
If E.x={x}, we say that xis E-invariant. If SEis any subset, we denote by
XS:={xX|sS:s.x=x}the set of all S-invariant elements of X. A subset
YXis E-stable if E.YY. If Eis a monoid, we write 1or 1Efor the unique neutral
element of E.
A.1.1 Definition. An algebraic semigroup is a semigroup (E,)which is also a C-
variety such that the composition map
µE:E×E E
(g,h) gh
is a morphism of varieties. A morphism of algebraic semigroups is a morphism
φ:EEof varieties between two algebraic semigroups which is also a morphism
of semigroups. An left-E-variety is a variety Xtogether with a morphism
αX:E×X X
which defines a left E-action on the set X. Similarly, the notion of a right-E-variety is
defined. An E-variety is defined to be a left-E-variety. A morphism of E-varieties is
a map φ:XYwhich is a morphism of varieties such that φ(g.x) = g.φ(x)for all
gEand xX.
125
A.1.2 Definition. An algebraic group (monoid) is an algebraic semigroup Ewhich is
also a group (monoid). The unit group of an algebraic monoid Eis defined as the
group G(E):={gE|1(gE)(Eg)}.
A.1.3 Proposition. The unit group of an algebraic monoid is an algebraic group. If
Gis an algebraic group, then Gis a smooth variety and the inversion morphism
ιG:GGmapping g g1is a morphism of varieties.
Proof. We first show that Gis smooth. The set of regular points of Gis open and
dense, therefore not empty. Let’s assume that Gis regular at g. Given any other point
hG, since the multiplication with the group element hg1is an automorphism of
G, it follows that h=hg1gis also a nonsingular point.
Let Ebe an algebraic monoid and G=G(E). The set Γ:=µ1
E(1)E×Eis
a closed subvariety because it is the preimage of a closed point. For i{1,2}we
denote by pri:E×EEthe projections. Then, G=pr1(Γ)pr2(Γ)is constructible
by [TY05, 15.4.3]. This means that Gis locally closed. Hence, it contains an open
subset Uof its closure Gin E. Because Gacts transitively on itself, we see that
G=gGg.Uis open in G, therefore a variety and consequently, an algebraic group.
Since any algebraic group Gis the unit group of itself, we can maintain the above
notation for the rest of the proof. We now show that ιGis a morphism of varieties.
Note that Γis the graph of the map ιG. Let πi:=pri|Γbe the restrictions of the
projection morphisms to Γ. Each of the πiis a bijective map from a variety to a smooth
variety, so by [TY05, Corollary 17.4.8] they are both isomorphisms of varieties. This
implies that ιG=π1
2π1is a morphism.
A.1.4 Example. Let W
=Cnand E=End(W)
=Cn×n.Eis an algebraic monoid
and the unit group of Eis the general linear group G=GL(W)of invertible linear
maps WW. By choosing a basis of W, we can identify Gwith the group GLnof
invertible complex n×nmatrices. We consider the set V=C[W]dof homogeneous d-
forms on W. It is a finite-dimensional C-vector space, hence a variety. We define the
(right) action αV:V×EVvia αV(P,a):=Pa.
A.1.5 Remark. Let Gbe an algebraic group. If Xis an affine G-variety, we have an
induced G-action on the coordinate ring C[X]by g.ϕ:= (g1)(ϕ) = ϕg1, where
we understand g1:XXas a morphism of varieties. Indeed, for g,hGand
ϕC[X],g.h.ϕ= (g1)((h1)(ϕ)) = (h1g1)(ϕ) = ((gh)1)(ϕ) = gh.ϕ.
The inverse is required to obtain a left action rather than a right action.
The following result will be important to reduce to the connected case:
126
A.1.6 Proposition. Let Gbe an algebraic monoid. There is a unique irreducible com-
ponent GGcontaining 1.Gis a closed submonoid of G.
If Gis an algebraic group, then Gis a normal subgroup and the irreducible
components of Gare the cosets hGfor hG. In particular, the quotient group G/G
is finite.
Proof. Let Gbe a connected component of Gcontaining 1and let Xbe any irreducible
component of G. Then XGis the image of the restriction of µGto X×G. This means
that XGis the image of an irreducible variety under a regular map, therefore XGis
an irreducible, closed subvariety of G. Since 1G, we have XXGXG.Xis
irreducible, so X=XG. This implies X=XGand similarly, we obtain X=GX.
In particular, G=GG, so Gis a closed submonoid. If 1was also contained in X,
then we could similarly conclude that GX=XG=G, so we have X=G.
Assume now that Gis a group. If Xis an irreducible component, then for any
hX, we know that h1Xis the image of an irreducible set under the automorphism
h, so it is an irreducible component. Since 1h1X, we know that h1X=G, hence
X=hG. For any hG, similarly observe that hGh1is a component containing 1,
so hGh1=G. This proves that Gis a normal subgroup. Since its cosets are the
(finitely many) irreducible components of X, this proves the statement.
A.1.1 Quotients by Algebraic Groups
Let Xbe a variety on which an algebraic group Gacts. Then, a categorical quotient
is a morphism π:XYsuch that
(C1) πis constant on G-orbits.
(C2) If ψ:XZis any morphism of varieties which is constant on G-orbits, then
there exists a unique ¯
ψ:YZwith ¯
ψπ=ψ.
If a categorical quotient exists, it is unique up to unique isomorphism and we
denote it by Y=X//G.
However, categorical quotients can lack many properties one would expect from
the quotient by a group action. For example, there are cases when the projection
morphism is not surjective. A much stronger notion is the one of a good quotient,
defined to be a morphism π:XYwith the following properties:
(G1) The morphism πis G-invariant and surjective.
(G2) The comorphism of πinduces an isomorphism OY
=(πOX)G.
(G3) For a closed and G-invariant subset ZX, the image π(Z)is closed in Y.
(G4) Two closed, G-invariant subsets Z1,Z2Xare disjoint if and only if π(Z1)
and π(Z2)are disjoint.
127
A good quotient is called a geometric quotient if the fibers of πare precisely the
G-orbits of X.
A.1.7 Remark. See [TY05, 25.2] for some remarks on Property (G2). The gist is that
for any open subset UY, the ring (πOX)(U) = OX(π1(U)) carries a G-module
structure, so we can consider the corresponding subring of G-invariant functions
OX(π1(U))G. This defines a subsheaf of rings (πOX)G(πOX).
By [New78, §4, Prop. 3.11], any good quotient is also a categorical quotient. If the
quotient is geometric, we denote it by X/G.
A.1.8 Theorem. Let Gbe an affine, reductive algebraic group acting on an affine vari-
ety X. The ring C[X]Gis a finitely generated C-algebra and Y=Spec(C[X]G)together
with the surjective morphism π:XYinduced by C[X]GC[X]defines a good
quotient of Xby G. Furthermore:
(1) If Gis finite and Xirreducible, then this quotient is geometric.
(2) For all xX, the fiber π1(π(x)) contains a unique closed G-orbit . Further-
more, we have π1(π(x)) ={yXG.y}.
Proof. See [TY05, 27.5] and [TY05, 25.5.2].
If a group Gacts on a set X, one can identify the orbit G.xof any element xX
with the set of cosets G/Gxwhere Gx={gG|g.x=x}is the stabilizer of x. It
turns out that under certain assumptions, this remains valid and compatible when X
is a G-variety.
A.1.9 Theorem ([TY05, 21.4]). Let Gbe an affine algebraic group and XaG-variety.
For every point xX, we have:
(1) The orbit G.xis a smooth subvariety of Xwhich is open in its closure G.x.
(2) The stabilizer Gxis a closed subgroup of Gand every component of G.xhas
dimension dim(G)dim(Gx).
(3) G.xis the union of G.xand orbits of strictly smaller dimension.
(4) If Xis affine and Gxis reductive, the right action of Gxon Gby multiplication
admits the good quotient G.x
=G//Gxand in particular G.xis an affine variety
with coordinate ring C[G.x]
=C[G]Gx.
A.1.2 Nullcone and Projective Quotients
If an algebraic group Gacts on a vector space V, the set N:={vV0G.v}
is called the nullcone of this action. Especially when Gis reductive, the geometry
128
of Ncarries a lot of information about the quotient, see [Kra85]. We will require the
following observations:
A.1.10 Lemma. Let Gbe a reductive affine algebraic group acting on a complex vector
space V. Let π:VV//Gbe the quotient and N Vthe nullcone. Then,
N=π1(π(0)), in particular Nis a closed subset of V.
N={vV0G.v}where Gis the connected identity component of G.
Nis the vanishing set of all homogeneous, G-invariant functions in C[V].
Proof. Since the origin 0 is a closed G-orbit, we have 0 G.vif and only if π(v) = π(0)
according to Theorem A.1.8.
For the second item, note that G=H·Gwhere His a finite set, according to
Proposition A.1.6. Since G.v=H.G.v=H.G.v, we can see that 0 G.vif and only
if 0 G.v.
Finally, the defining ideal of π(0)in C[V]Gis equal to the maximal ideal generated
by all homogeneous elements of positive degree, and since πcorresponds to the in-
clusion C[V]GC[V], the set Nis the vanishing set of the same elements, considered
as functions on V.
If Vis a G-module, the projective space P(V)is a G-variety in a natural way. A
linearized projective G-variety is a projective variety Xwith a fixed embedding as
a closed, G-invariant subset XP(V)for some G-module V. We write ˇ
XV
for the affine cone over X, i.e., ˇ
Xis an affine variety which is closed under scalar
multiplication and X=P(ˇ
X).
If Xis a linearized projective G-invariant closed cone, then C[ˇ
X]is a graded alge-
bra. It is the projective coordinate ring of X. The invariant algebra C[ˇ
X]Ginherits this
grading and in particular, one can consider the projective variety Y:=Proj(C[ˇ
X]G)
and the induced rational map π:X99K Y. If we denote by [x]Xthe projective
class of a point xˇ
X, then π([x]) = [ ˇ
π(x)] where ˇ
π:ˇ
Xˇ
X//Gis the affine quo-
tient from Theorem A.1.8. This rational map is undefined when ˇ
π(x) = 0, which by
Lemma A.1.10 means that PNis the indeterminacy locus of π. Hence, Xss :=X\PN
is the open set where πis defined. It is called the set of semistable points of X, with
respect to the (induced) action of Gon X. One also calls ˇ
Xss :=ˇ
X\N the set of
semistable points of ˇ
X. Since Nis a cone, we have Xss = (Pˇ
X)ss =P(ˇ
Xss)and we
sometimes omit the brackets. The restriction π:Xss Yis a morphism.
A.1.11 Proposition. Let Xbe a linearized projective G-variety and Sits projective
coordinate ring. The projective variety Y:=Proj(SG)together with the morphism
π:Xss Yinduced by SGSdefines a good quotient Y=Xss//G.
129
Using Theorem A.1.8, the proof is straightforward. We omit it for brevity and refer
to [Dol03, Thm. 8.1, Prop. 8.1] for a modern treatment and [New78, §4, Thm. 3.14] for
a more classical version. We note the following amplification of Property (C2):
A.1.12 Proposition. Let Xbe a linearized projective G-variety and let ϖ:X99K Ybe
aG-invariant rational map between projective varieties. Then, there exists a rational
map φ:Xss//G99K Zsuch that the following diagram commutes:
Xss
π
ϖY
Xss//G
φ
Here, πis the quotient morphism.
Proof. Fix some embedding of Yin a projective space. Let Sand Rbe the projective
coordinate rings of Xand Y, respectively. The rational map ϖcorresponds to a homo-
morphism ϖ:RSof graded C-algebras and the assumption that ϖis G-invariant
means that the image of ϖis contained in the invariant ring SG. Since SGis the
projective coordinate ring of Xss//G, this homomorphism RSGinduces a rational
map φ:Xss//G99K Y. The inclusion SGSof graded rings corresponds to the
rational map ˜
π:X99K Xss//Gwhich restricts to π, so we have φ˜
π=ϖas rational
maps. Restricting to Xss yields the result.
A.2 Representation Theory of Reductive Groups
Let Gbe an affine algebraic group throughout this section. A Borel subgroup of Gis
a maximal connected solvable subgroup Bof G. It follows from the definition that a
Borel subgroup of Gis closed, because if HGis connected and solvable, then so is
its closure. By [Hum98, 21.3], G/Bis a projective variety and all Borel subgroups of
Gare conjugate. We fix one Borel Bof G.
The only connected, algebraic C-groups of dimension one are the multiplicative
group Gm:=Gm(C):= (C×,·)and the additive group Ga:=Ga(C):= (C,+), see
[TY05, 22.6.2]. A product of multiplicative groups Gr
mis called a torus.
Gcontains a unique largest normal solvable subgroup [TY05, 27.1.1], which is au-
tomatically closed. Its identity component is then the largest connected normal solv-
able subgroup of Gand is called the radical of G, denoted R(G). By [TY05, 27.1.2],
the set of unipotent elements of R(G)is the largest connected normal unipotent sub-
group of G. It is denoted by Ru(G)and we call it the unipotent radical of G. An
affine algebraic group Gis called semisimple (resp. reductive) if R(G)(resp. Ru(G))
is trivial. Note that we follow [TY05, 27.2] here and do not require Gto be connected.
130
However, an algebraic group Ghas a unique, connected, normal subgroup Gcon-
taining the neutral element [TY05, 21.1.5 & 21.1.6] and by definition, Gis semisimple
(resp. reductive) if and only Ghas this property.
For example, each torus is reductive. Furthermore, GLnis reductive, but not
semisimple. However, the subgroup SLnGLnof matrices with determinant 1 is
semisimple.
By [Hum98, 21.3], the maximal tori and the maximal unipotent subgroups of G
are those of the Borel subgroups of G, and all of them are also conjugate. We fix a
maximal torus TBand denote by U=Ru(B)the unipotent radical of B. We have
B=UoTby [Hum98, 19.3].
A.2.1 Example. The affine space Cn×nof all n×nmatrices with entries in Chas the
coordinate ring C[xij i,j[n]]. The group G:=GLnof invertible n×nmatrices is
the set of matrices where detndoes not vanish and therefore a Zariski-open subset
of Cn×n. Hence, Gis an affine algebraic group with coordinate ring C[xij, det1
n].
Denote by S:=SLnthe set of matrices with determinant 1, i.e., the vanishing set of
detn1. It is a closed, affine algebraic subgroup of G.
Denote by Inbe the unit matrix, then it is well-known that the center of G=GLn
is equal to Z:=C×·In. By [Lan02, Part 3, Chapter XIII, 8 & 9], the projective linear
group PGLn=G/Z=S/(SZ)is simple. Thus, if REGis solvable, then R/Z
is also solvable and can only be trivial or equal to PGLn. However, PGLnis not
solvable because Sis perfect (i.e.,[S,S]=S) and therefore not solvable. Thus, R/Zis
trivial, which means RZ. Since Zis connected, normal and solvable, we must have
R(G) = Z. The only unipotent central matrix is the unit matrix, so Ru(G)is trivial
and we have verified that Gis reductive.
A maximal Borel subgroup Bof Gis given by the upper triangular matrices of
nonvanishing determinant and its unipotent radical Uis the subgroup of those ele-
ments where all diagonal entries are equal to 1. The maximal torus Tcorresponding
to this choice are the invertible diagonal matrices.
A.2.1 Representations
Let Gbe an algebraic group. A representation of Gis a homomorphism of algebraic
groups ϱ:GGL(V(ϱ)), where V(ϱ)is some finite-dimensional C-vector space. The
space V(ϱ)becomes a G-variety via the induced action. We also write V
G(ϱ)instead
of V(ϱ)if we want to put emphasis on the group. The degree of ϱis defined to be
dim(V(ϱ)).
Conversely, if Vis some C-vector space which is also a G-variety, we say that Vis
aG-module. We remark that the G-modules are in bijection with the representations
131
of Gby associating to a representation ϱ:GGL(V)the action α:G×VVwhere
α(g,v):=ϱ(g)(v).
Ahomomorphism of G-representations is a C-linear map φ:V
G(ϱ1)V
G(ϱ2)
such that ϱ2(g)φ=φϱ1(g)for all gG. Equivalently, a homomorphism of G-
modules is a C-linear map φ:V1V2where φ(g.x) = g.φ(x)for all gG.
We say that ϱ1is a subrepresentation of ϱ2if there is an injective homomorphism
of G-representations V
G(ϱ1)V
G(ϱ2). Equivalently, a submodule of a G-module
V1is a linear subspace V1V2such that V1is stable under the action of G. A
representation ϱis irreducible if ϱhas no nontrivial subrepresentations. Equivalently
aG-module Vis irreducible if it has no G-invariant linear subspaces other than V
and {0}. Denote by Rep(G)the set of all equivalence classes of G-representations
and by Irr(G)Rep(G)the subset of all irreducible representations. If HGis a
subgroup, we have a canonical restriction map Rep(G)Rep(H),ϱ ϱ|H.
The arguably most important lemma of representation theory is the following:
A.2.2 Lemma (Schurs Lemma). Let Gbe an algebraic group and ϱ1,ϱ2Irr(G). Ev-
ery nonzero homomorphism φ:V(ϱ1)V(ϱ2)of G-representations is an isomor-
phism.
Proof. Note that for vker(φ), we have φ(g.v) = g.φ(v) = g.0 =0, hence ker(φ)is a
G-invariant subspace of V(ϱ1). Since φis nonzero, we must have ker(φ) ={0}.
For two representations ϱ1,ϱ2Rep(G)with Vi:=V(ϱi), we define the repre-
sentation ϱ1ϱ2:GGL(V1V2)via g ϱ1(g)ϱ2(g). For ϱRep(G), let
ϱ:GGL(V
G(ϱ))be the representation defined by ϱ(g):= (ϱ(g))1, i.e., the
group homomorphism corresponding to the action
G×V
G(ϱ) V
G(ϱ)(1)
(g,ϕ) ϕϱ(g)1
A.2.2 Characters, Roots, Weights
Assume henceforth that Gis reductive, we have chosen a Borel BGand a maximal
torus Gr
m
=TB.
A multiplicative (resp. additive) character of an algebraic group His a homomor-
phism of algebraic groups HGm(resp. HGa). Conversely, a multiplicative
(resp. additive) one-parameter-subgroup (also called 1-psg for short) is a homomor-
phism of algebraic groups GmH(resp. GaH). When we say character or
1-psg, we refer to the multiplicative version. The set of characters X(H)and the set
132
of 1-psgs ˇ
X(H)become abelian groups under pointwise multiplication. There are
canonical isomorphisms
Zr X(Gr
m)Zr ˇ
X(Gr
m)(2)
λ χλγ ˇ
χγ
where χλand ˇ
χγare defined as
χλ:Gr
m Gmˇ
χγ:Gm Gr
m
t tλ:=tλ1
1···tλr
rx tγ:= (tγ1, . . . , tγr).
For (γ,λ)ˇ
X(H)×X(H)we define γ,λZto be the integer corresponding to
λγX(Gm)
=Z. In other words, gγ,λ= (gγ)λfor all tC×. It is well-known
[TY05, 22.5.2] that X(T)
=Irr(T)in the sense that any ϱRep(T)decomposes as a
direct sum of weight spaces
V(ϱ)α={vV(ϱ)|tT:t.v=tαv}.
If ϱRep(G), we can also regard V(ϱ)as a T-module and denote by V(ϱ)αthe
corresponding weight space. We call Λ(G):=X(T)the weight lattice of G.
To avoid an introduction to the theory of Lie algebras ([Hum98, Chapter III] or
[TY05, 23]), we deviate from [Hum98] in defining αΛ(G)to be a root of Gwith
respect to Tto be a if there exists an additive 1-psg εα:GaGsuch that
tT:xC:t·εα(x)·t1=εα(tαx).
This definition of root is equivalent to the one given in [Hum98, 16.4]: Indeed, one
direction of this equivalence is stated in [Hum98, 26.3] and the proof actually verifies
the other direction as well.
Denote by Φ:=Φ(G,T)Λ(G)the set of roots of Gwith respect to T. See
[Hum98, 27] for a proof that Φis a so-called abstract root system and [Hum80, III.9.2] or
[TY05, 18] for general facts about abstract root systems. For αΦ, set Uα:=εα(Ga).
We call Φ+:={αΦ|UαB}the positive roots of Gwith respect to B.
A.2.3 Example. We continue Example A.2.1 and keep the notations already estab-
lished. We identify Λ(G) = X(T)with Znin the above way. For i,j[n]and i=j,
denote by αij Znthe vector with the value 1 in position i, the value 1 in position j
and 0 elsewhere. We claim that these weights are the roots of Gwith respect to T.
For i,j[n], we define Eij Cn×nto be the matrix with:
xrs(Eij) = {1 ; (r,s) = (i,j)
0 ; (r,s)= (i,j)
133
With these notations, we define the map
εij :Ga=C G
x In+x·Eij
It is elementary to check that this is a morphism of algebraic groups and furthermore,
for any xCand any diagonal matrix t= (t1, . . . , tn)T,
t·εij(x)·t1=t·(In+x·Eij)·t1=tInt1+x·tEijt1
=In+x·ti
tj·Eij =εij(tαij x).
We therefore know that the αij are certainly roots. With more prelude and the the-
ory of Lie algebras, one can see rather directly that these are, in fact, all the roots.
See [Kra85, Bemerkung 4, III.1.4] or [GW09, Theorem 2.4.1] for a quite elementary
treatment.
The group Uij :=Uαij is now the image of εij and from the definition we can see
that Uij ={In+t·Eij tC}consists of matrices with 1 on the main diagonal and
zeros everywhere else except at position (i,j). Since we chose Bto be the group of
upper triangular matrices, we can see that αij Φ+if and only if i<j.
It can be shown that there exists a (unique) subset Φof linearly independent
weights with the property that for all αΦ+, there are nonnegative integers cδsuch
that α=δcδδ. See [Hum80, III.10.1] or [TY05, 18.7.4] for a proof of this result. A
root belonging to is called a simple root and||is called the rank of G.
For every root αΛ(G) = X(T), there is a unique dual root ˇ
αˇ
X(T)satisfying
the two conditions
ˇ
α,α=2 and
βΦ:βˇ
α,β·αΦ.
This is part of the root system axioms, see [Hum80, III.9.2] or [TY05, 18.2.1]. Note
that the map β βˇ
α,β·αis referred to as σαin [Hum80], as sα,ˇ
αin [TY05] and
as ταIn [Hum98]. It is called the reflection relative to α. It maps αto αand leaves a
certain hyperplane Hαinvariant. We then call
Λ+(G):={λΛ(G)αΦ+:ˇ
α,λ0}
the dominant weights of G, the notation being justified in that they lie on the side of
Hαwhich we have marked as “positive” by our choice of Φ+, which in turn is based
on the choice of B. See [TY05, 18.11.7] for a proof that Λ+(G)is a finitely generated
semigroup.
134
A.2.4 Example. We continue Example A.2.3. For the set , we choose in this case the
roots αi:=αi,(i+1)for i[n1]. There are n1 of these roots, they are linearly
independent and for i<j, we have
αij =αi+αi+1+···+αj1.
The rank of GLnis therefore n1.
Given any root αZn
=X(T), we define its dual to be the 1-psg corresponding to
the same tuple via (2). With this definition,ˇ
α,αcorresponds simply to the ordinary
scalar product Zn×ZnZgiven by (α,β) αtβ. We identify αand ˇ
α. Clearly,
αij,αij=αijtαij =2.
Let λZn. For i<j, we have λiλjif and only ifαij,λ=λiλj0. Hence,
the dominant weights of GLnare precisely the weakly descending vectors of integer
numbers such a vector is called a generalized partition.
We remark that in this case, it is easy to see that the semigroup Λ+(GLn)is finitely
generated: The vectors
ω1:= (1, 0,0, . . . , 0),
ω2:= (1, 1,0, . . . , 0),
.
.
.
ωn:= (1, 1,1, . . . , 1)
(3)
form a generating set together with ωn.
We denote by Λr(G)the N-span of Φ+. The group Λr(G)generated by Λr(G)is
equal to the Z-span of all roots and is sometimes called the root lattice of G.
For two characters λ,µX(T)we write λDµif λµΛr(G).
A.2.5 Example. Still continuing Example A.2.4, we remark finally that the root lattice
of G=GLnconsists of all λZnwith the property that the sum of all entries
vanishes, i.e.,|λ|:=λ1+···+λn=0. Furthermore,
Λr(G) ={λZn|k[n]:λ1+···+λk0,|λ|=0}.
Indeed, if λ=n1
k=1ckαkis a sum of simple roots with ckZ, then λk=ckck1.
Hence, ckNfor all kNif and only if
λ1+···+λk=c1+ (c2c1) + ···+ (ckck1) = ck0.
For dominant weights λ,µΛ+(G), the dominance order is in agreement with
the following combinatorial definition of dominance order for integer partitions: Two
integer partitions λ= (λ1, . . . , λn)and µ= (µ1, . . . , µn)satisfy λDµif and only if
|λ|=|µ|and for all kn, we have λ1+···+λkµ1+···+µk.
135
Finally, we sum up the characterization of the irreducible representations of Gin
the following theorem:
A.2.6 Theorem. Let ϱIrr(G)and V:=V(ϱ). There exists a unique dominant
weight λΛ+(G)such that:
(1) As a T-module, V=µEλVµ.
(2) dim(Vλ) = 1. Elements of Vλare called highest weight vectors.
(3) Uacts trivially on Vµif and only if µ=λ.
(4) For αΦand µΛ(G), we have Uα.VµdNVµ+dα.
This defines a bijection Irr(G)
=Λ+(G). We henceforth identify the two.
Proof. The main statement as well as (1) to (3) can be found in [Hum98, Theorem 31.3]
together with the statement [Hum98, 31.4] which proves that each dominant weight
has a corresponding irreducible representation. Part (4) is [Hum98, 27.2].
For ϱRep(G), the module V:=V(ϱ)decomposes as
V=
λΛ
+(G)
V(λ)
where V(λ)
=V(λ)nλfor certain nλN. We call V(λ)the isotypical component of
weight λof V. When fV(λ), we call wt(f):=λthe weight of f.
A.2.3 The Coordinate Ring of an Algebraic Group
We recite a famous, general result about the weights that appear in the coordinate
ring of a reductive, affine, algebraic group:
A.2.7 Theorem (Algebraic Peter-Weyl Theorem). Let Gbe a reductive, affine, alge-
braic group. The group G×Gacts on Gfrom the left via (g,h).x=gxh1. Consider
ψ:
λIrr(G)
V(λ)V(λ) C[G]
fv fωv
where ωv:GV(λ)is the morphism g g.v.
Then, ψis an isomorphism of G×G modules, where the action of the left and
right factor of G×Gis on the left and right tensor factor, respectively.
Proof. See [TY05, 27.3.9]. We give a quick remark about the action. Recall from (1)
that gGacts on fV(λ)by g.f=fg1and recall from Remark A.1.5 that the
induced action on the coordinate ring of Gis such that for ϕC[G]and g,h,xG
136
we have ((g,h).ϕ)(x) = ϕ(g1xh). Let λIrr(G),fV(λ),vV(λ)and g,hG.
Then, the following holds for all xG:
((g,h).ψ(fv))(x) = ((g,h).(fωv))(x) = f(ωv(g1xh)) = f(g1xh.v)
= (fg1)(x.(h.v)) = (( fg1)αh.v)(x) = ψ((g.f)(h.v))(x).
Hence, (g,h).ψ(fv) = ψ((g.f)(h.v)) as claimed.
A.2.8. We summarize Examples A.2.1 and A.2.3 to A.2.5 as follows: The dominant
weights for the reductive group GLnare the generalized partitions
Λ+
n:={λZn|λ1 ··· λn}.
We denote by V(λ)the irreducible GLn-module corresponding to the highest weight
λΛ+
n. Every GLn-module Vdecomposes as V=λΛ
+
nV(λ)nλfor certain nλ
N.
A.2.9 Proposition. The coordinate ring of GLnCn×nhas a natural Z-grading be-
cause it is the localization of a polynomial ring at the homogeneous polynomial detn.
For dZ, the isomorphism from Theorem A.2.7 restricts to
C[GLn]d
=λΛ
+
n
|λ|=d
V(λ)V(λ).
Proof. Let λΛ+
n,d:=|λ|and fvV(λ)V(λ). By Theorem A.2.7 we only
have to show that fωvis homogeneous of degree d. Given tC×, Theorem A.2.6
implies that δt:=diag(t, . . . , t)acts on a vector vV(λ)as δt.v=td·vbecause for
all µEλ, we have|µ|=|λ|=d(Example A.2.5). Hence,
f(ωv(t·g)) = f(ωv(gδt)) = f(g.δt.v) = f(g.(td·v)) = f(td·(g.v))
=td·f(g.v) = td·f(ωv(g)).
Note that the fourth and fifth equality are due to the fact that v g.vand fare both
linear maps.
The module V(λ)is also an irreducible module. More precisely, for any dominant
weight λ= (λ1, . . . , λn)Λ+
nwe define λ:= (λn, . . . , λ1)Λ+
n. Then,
A.2.10 Proposition. Let λΛ+
n. Then, V(λ)=V(λ)is an irreducible module.
Proof. This follows from Theorem A.2.6 and (1).
137
A.2.11. Proposition A.2.10 is a special case of a general principle. We denote by Bthe
set of all Borel subgroups of G. By [Hum98, Proposition A in 24.1], the Normalizer
N:=NG(T)of Tpermutes the elements of Btransitively. For G=GLnand T
the subgroup of diagonal invertible matrices, note that Nis the semidirect product
SnnT, where the permutation group Snis embedded in GLnas the subgroup of
permutation matrices.
The induced action of the Weyl group S:=N/Tis free, so Sparametrizes exactly
the possible choices of a base for the root system with respect to the torus T. The
action of Son Bis given by B Bq:=q1Bq for qN.
Let τNbe such that BBτ=T. Note that in the case of G=GLn, this is simply
the permutation that maps (1, . . . , n)to the reverse vector (n, . . . , 1), because Bis the
set of upper triangular matrices and BBτ=Tif and only if Bτis the set of lower
triangular matrices.
The image of τin Sis called the longest Weyl element. In the literature, it is often
denoted by w0. With the terminology from [Spr08], w1
0has the same length as w0and
by [Spr08, Corollary 8.3.11], an element with this property is unique, hence w0=w1
0.
It follows that τ=τ1.
The Weyl group acts on the weights X(T)as follows: For any χX(T)and qN,
we define the character q.λvia the rule (q.λ)(t):=λ(qtq1). Then, the longest Weyl
element maps highest weights to highest weights [Hum98, 31.6]. More precisely, we
have V(λ)=V(τ.λ). In the case of G=GLn, we have τ.λ=λ.
In particular:
A.2.12 Proposition. Let Vbe a G-module. Then, VG
=(V)G.
Proof. By Theorem A.2.6,V=λΛ
+(G)V(λ), so VG=V(0). By the above Para-
graph A.2.11,V=λΛ
+(G)V(τ.λ)where τdenotes the longest Weyl element.
Since τ.λ=0 is equivalent to λ=0, we have (V)G
=VG.
We end by proving a Lemma that we will require in the next section and which is
of independent interest:
A.2.13 Lemma. Let Vbe a G-module (or more generally, a factorial G-variety) and
fC[V]. Assume that f=fk1
1··· fkr
ris the decomposition of finto irreducible
factors. Then, fis a highest weight vector of weight λif and only if the fiare highest
weight vectors of weight λiwith λ=k1λ1+···+krλr.
Proof. If the fiare weight vectors, then fis still U-invariant and it is easy to see that
the weight of fmust be equal to k1λ1+···+krλr.
For the converse, assume that fis a highest weight vector of weight λand denote
by Z:=Z(f)Vthe vanishing set of f. Note that Zis B-invariant, so we have
138
an action α:B×ZZinduced by the action of Bon V. Let Zi:=Z(fi), then
Z=Z1. . . Zris the decomposition of Zinto its irreducible components. Let
αi:B×ZiZbe the restriction of αto Zi. Since Bis connected, B×Ziis irreducible
and Ziis contained in the image of αi therefore, B.Zi=αi(B×Zi) = Zi. Hence,
the Ziare invariant under B. It follows that B.fiC·fiso Bacts by a character on fi.
This means that fiis a highest weight vector of some weight λi. Consequently, we
must have λ=k1λ1+···+krλr.
A.3 Polynomial Representations
Let W
=Cnbe a complex vector space with a chosen basis. The action of GLnon a
module V
=CNis a morphism ϱ: GLnGLNof algebraic groups. Hence, it can
be described by N2regular functions on GLn. In many cases we are interested in,
these functions are polynomials in the entries of an n×nmatrix, i.e., ϱextends to a
morphism EndnEndNof affine spaces. In other words, the action of GLnextends
to an action V×EndnV. We call such a representation a polynomial one.
A.3.1 Definition. There is a partial ordering on Λ+
n, called the inclusion, defined
as follows: We have µλif and only if µiλifor all 1 in.
Note that µλimplies µEλ, but not the converse. We have
Λ+
n:={λΛ+
nλ0}={λZn|λ1. . . λn0}Nn,
the set of partitions of the number n. We will show:
A.3.2 Proposition. Let λΛ+
n, then V(λ)is polynomial if and only if λΛ+
n.
When Vis any GL(W)-module, we can decompose it as V
=λΛV(λ)nλfor
certain nλN. We then write
V0:=
λΛ
+
n
λ0
V(λ)nλV.
We call V0the polynomial part of V. It then follows:
A.3.3 Corollary. A GLn-module Vis polynomial if and only if V=V0.
We require an auxiliary lemma for the proof of Proposition A.3.2.
A.3.4 Lemma. The inclusion GLnEndnis an open, GLn-equivariant immersion of
varieties under the operation of GLnacting by multiplication from the left on both
affine varieties. It induces an inclusion of their respective coordinate rings which
satisfies C[Endn] = C[GLn]0=λΛ
+
nV(λ)V(λ).
139
Proof. By [Lan12, Formula (6.5.1)], we get the last equality in
C[End(W)]d
=C[WW]d
=Symd(WW)
=
0λΛ
+
n
λ1+···+λn=d
V(λ)V(λ).
By summing over dand applying Theorem A.2.7, we obtain
C[End(W)]
=
λΛ
+
n
V(λ)V(λ)=C[GL(W)]0
by the definition of the polynomial part.
Proof of Proposition A.3.2.By Theorem A.2.7, the module C[GLn]contains a unique
highest weight vector of weight µfor every µΛ+
n, up to a scalar. Let fbe the
highest weight vector of weight λ= (λ1, . . . , λn). By Lemma A.3.4, we have λΛ+
n
if and only if fextends to a regular function on Endn, i.e., it is a polynomial. Let
f=q·detk
nwith kZand qC[Endn]not divisible by detn. We have to show that
k0 is equivalent to λn0. In fact, we show that k=λn.
Denote by δ:= (1, . . . , 1)Λ+
n, then wt(detn) = δ. We claim that µ:=wt(q)
satisfies µn=0. Indeed, otherwise we’dhave µδΛ+
nand Lemma A.2.13 would
imply that qis divisible by the (unique) weight vector of weight δ, which is detn.
Consequently, λn=µn+k=k.
140
Bibliography
[ABV15] Jarod Alper, Tristram Bogart, and Mauricio Velasco. “A Lower Bound for
the Determinantal Complexity of a Hypersurface”. In: Foundations of Com-
putational Mathematics (2015), pp. 1–8.
[AM69] Michael Francis Atiyah and Ian Grant Macdonald. Introduction to Commu-
tative Algebra. Addison-Wesley, 1969.
[AT92] Noga Alon and Michael Tarsi. “Colorings and orientations of graphs”. In:
Combinatorica 12.2 (1992), pp. 125–134.
[Atk83] M. D. Atkinson. “Primitive spaces of matrices of bounded rank. II”. In: J.
Austral. Math. Soc. Ser. A 34.3 (1983), pp. 306–315.
[AYY13] Jinpeng An, Jiu-Kang Yu, and Jun Yu. “On the dimension datum of a
subgroup and its application to isospectral manifolds”. In: J. Differential
Geom. 94.1 (2013), pp. 59–85. url:http://projecteuclid.org/euclid.
jdg/1361889061.
[BCI11] Peter Bürgisser, Matthias Christandl, and Christian Ikenmeyer. “Even par-
titions in plethysms”. In: J. Algebra 328 (2011), pp. 322–329.
[BCS97] Peter Bürgisser, Michael Clausen, and M. Amin Shokrollahi. Algebraic com-
plexity theory. Vol. 315. Grundlehren der Mathematischen Wissenschaften
[Fundamental Principles of Mathematical Sciences]. With the collaboration
of Thomas Lickteig. Springer-Verlag, Berlin, 1997, pp. xxiv+618.
[BD06] Matthias Bürgin and Jan Draisma. “The Hilbert null-cone on tuples of
matrices and bilinear forms”. In: Math. Z. 254.4 (2006), pp. 785–809.
[BI11] Peter Bürgisser and Christian Ikenmeyer. “Geometric complexity theory
and tensor rank”. In: STOC’11—Proceedings of the 43rd ACM Symposium on
Theory of Computing. extended abstract. ACM, New York, 2011, pp. 509–
518.
141
[BI15] Peter Bürgisser and Christian Ikenmeyer. “Fundamental invariants of orbit
closures”. To appear in J. Algebra. 2015. url:http://arxiv.org/abs/
1511.02927.
[BIH17] Peter Bürgisser, Christian Ikenmeyer, and Jesko Hüttenhain. “Permanent
Versus Determinant: Not Via Saturations”. In: Proc. AMS 145 (July 2017),
pp. 1247–1258.
[BIP16] Peter Bürgisser, Christian Ikenmeyer, and Greta Panova. “No Occur-
rence Obstructions in Geometric Complexity Theory”. arχiv:1604.06431,
accepted for Publication in FOCS 2016. 2016.
[BKT12] Franck Butelle, Ronald King, and Fréric Toumazet. SCHUR. Version 6.07.
Mar. 31, 2012. url:http://sourceforge.net/projects/schur/.
[BL04] Peter Bürgisser and Martin Lotz. “Lower bounds on the bounded coeffi-
cient complexity of bilinear maps”. In: J. ACM 51.3 (2004), 464–482 (elec-
tronic).
[Bor+16] A. Boralevi et al. “Uniform determinantal representations”. 2016. url:
https://arxiv.org/abs/1607.04873.
[BOR09] Emmanuel Briand, Rosa Orellana, and Mercedes Rosas. “Reduced Kro-
necker coefficients and counter-examples to Mulmuley’s saturation con-
jecture SH”. In: Computational Complexity 18.4 (2009), pp. 577–600.
[Bot67] Peter Botta. “Linear transformations that preserve the permanent”. In:
Proc. Amer. Math. Soc. 18 (1967), pp. 566–569.
[Bri11] Michel Brion. “Invariant Hilbert schemes”. 2011. url:https:/ /arxiv.
org/abs/1102.0198.
[Bri87] Michel Brion. “Sur l’image de l’application moment”. In: Séminaire
d’algèbre Paul Dubreil et Marie-Paule Malliavin (Paris, 1986). Vol. 1296.
Lecture Notes in Math. Springer, Berlin, 1987, pp. 177–192.
[Bri93] Michel Brion. “Stable properties of plethysm: on two conjectures of
Foulkes”. In: Manuscripta Math. 80.4 (1993), pp. 347–371.
[Bür+11] Peter Bürgisser et al. “An overview of mathematical issues arising in the
geometric complexity theory approach to VP =VNP”. In: SIAM J. Comput.
40.4 (2011), pp. 1179–1209.
[Bü00] Peter Bürgisser. Completeness and reduction in algebraic complexity theory.
Vol. 7. Algorithms and Computation in Mathematics. Springer-Verlag,
Berlin, 2000, pp. xii+168.
142
[CKW10] Xi Chen, Neeraj Kayal, and Avi Wigderson. “Partial derivatives in arith-
metic complexity and beyond”. In: Found. Trends Theor. Comput. Sci. 6.1-2
(2010), front matter, 1–138 (2011).
[Coo71] Stephen A. Cook. “The Complexity of Theorem-proving Procedures”. In:
Proceedings of the Third Annual ACM Symposium on Theory of Computing.
STOC ’71. New York, NY, USA: ACM, 1971, pp. 151–158.
[DK02] Harm Derksen and Gregor Kemper. Computational invariant theory. Invari-
ant Theory and Algebraic Transformation Groups, I. Encyclopaedia of
Mathematical Sciences, 130. Springer-Verlag, Berlin, 2002, pp. x+268.
[Dol03] Igor Dolgachev. Lectures on Invariant Theory. Cambridge University Press,
2003.
[Dri98] Arthur Arnold Drisko. “Proof of the Alon-Tarsi conjecture for n=2rp”.
In: Electron. J. Combin. 5 (1998), Research paper 28, 5 pp. (electronic). url:
http://www.combinatorics.org/Volume_5/Abstracts/v5i1r28.html.
[EH00] David Eisenbud and Joe Harris. The geometry of schemes. Vol. 197. Graduate
Texts in Mathematics. Springer-Verlag, New York, 2000, pp. x+294.
[EH88] David Eisenbud and Joe Harris. “Vector spaces of matrices of low rank”.
In: Adv. in Math. 70.2 (1988), pp. 135–155.
[Eis94] David Eisenbud. Commutative Algebra with a View Toward Algebraic Geome-
try. New York: Springer, 1994.
[EZ62] H. Ehlich and K. Zeller. “Binäre Matrizen”. In: Zeitschrift für Angewandte
Mathematik und Mechanik 42.S1 (1962), T20–T21.
[FH04] William Fulton and Joe Harris. Representation Theory A First Course. fifth.
Springer, 2004.
[FLR85] P. Fillmore, C. Laurie, and H. Radjavi. “On matrix spaces with zero deter-
minant”. In: Linear and Multilinear Algebra 18.3 (1985), pp. 255–266.
[For09] Lance Fortnow. “The Status of the P Versus NP Problem”. In: Commun.
ACM 52.9 (Sept. 2009), pp. 78–86.
[Fro97] Ferdinand Georg Frobenius. “Über die Darstellung der endlichen Gruppe
durch lineare Substitutionen”. In: Sitzungsberichte der Königlich Preussischen
Akademie Der Wissenschaften zu Berlin (1897).
[Ful97] William Fulton. Young tableaux. Vol. 35. London Mathematical Society Stu-
dent Texts. With applications to representation theory and geometry. Cam-
bridge University Press, Cambridge, 1997, pp. x+260.
143
[GKZ94] Israïl Moyseyovich Gel’fand, Mikhail M. Kapranov, and Andrei Vladlen-
ovich Zelevinsky. Discriminants, resultants, and multidimensional deter-
minants. Mathematics: Theory & Applications. Birkhäuser Boston Inc.,
Boston MA, 1994, pp. x+523.
[Gly10] David Gerald Glynn. “The conjectures of Alon-Tarsi and Rota in dimen-
sion prime minus one”. In: SIAM J. Discrete Math. 24.2 (2010), pp. 394–399.
[Gre11] B. Grenet. “An upper bound for the permanent versus determinant prob-
lem”. In: (2011). manuscript.
[Gro67] A. Grothendieck. “Éléments de géométrie algébrique. IV. Étude locale des
schémas et des morphismes de schémas IV”. In: Inst. Hautes Études Sci.
Publ. Math. 32 (1967), p. 361.
[GW09] Roe Goodman and Nolan Russell Wallach. Symmetry, Representations, and
Invariants. Springer, 2009.
[GW10] Ulrich Görtz and Torsten Wedhorn. Algebraic Geometry I. Wiesbaden:
Vieweg + Teubner, 2010.
[Har06] Robin Hartshorne. Algebraic Geometry. New York: Springer, 2006.
[Har95] Joe Harris. Algebraic Geometry A First Course. Springer, 1995.
[HI16] J. Hüttenhain and C. Ikenmeyer. “Binary Determinantal Complexity”. In:
Linear Algebra and its Applications 504 (2016), pp. 559 –573.
[HL16] J. Hüttenhain and P. Lairez. “The Boundary of the Orbit of the 3-by-3 De-
terminant Polynomial”. In: Comptes rendus Mathematique 354.9 (Sept. 2016),
pp. 931–935.
[Hum80] James Edward Humphreys. Introduction to Lie Algebras and Representation
Theory. 3rd. Springer, 1980.
[Hum98] James Edward Humphreys. Linear Algebraic Groups. Springer, 1998.
[Hü12] J. Hüttenhain. “From Sylvester-Gallai Configurations to Branched Cover-
ings”. Diplomarbeit. Universität Bonn, 2012.
[Hü17] Jesko Hüttenhain. “A Note on Normalizations of Orbit Closures”. In: Com-
munications in Algebra 45.9 (Nov. 2017), pp. 3716–3723.
[IK99] Anthony Iarrobino and Vassil Kanev. Power sums, Gorenstein algebras, and
determinantal loci. Vol. 1721. Lecture Notes in Mathematics. Appendix
C by Iarrobino and Steven L. Kleiman. Springer-Verlag, Berlin, 1999,
pp. xxxii+345.
144
[Ike12] Christian Ikenmeyer. “Geometric Complexity Theory, Tensor Rank, and
Littlewood-Richardson Coefficients”. PhD thesis. Germany: Universität
Paderborn, 2012.
[Ike16] Christian Ikenmeyer. Private communication. 2016.
[IP16] Christian Ikenmeyer and Greta Panova. “Rectangular Kronecker Coef-
ficients and Plethysms in Geometric Complexity”. arχiv:1512.03798, ac-
cepted for Publication in FOCS 2016. 2016.
[Kay12] Neeraj Kayal. “Affine projections of polynomials [extended abstract]”. In:
STOC’12—Proceedings of the 2012 ACM Symposium on Theory of Computing.
ACM, New York, 2012, pp. 643–661.
[KM02] Irina A. Kogan and Marc Moreno Maza. “Computation of canonical forms
for ternary cubics”. In: Proceedings of the 2002 International Symposium on
Symbolic and Algebraic Computation. ACM, New York, 2002, 151–160 (elec-
tronic).
[Knu98] Donald Ervin Knuth. The art of computer programming. Vol. 2. Seminu-
merical algorithms, Third edition. Addison-Wesley, Reading, MA, 1998,
pp. xiv+762.
[Koi04] Pascal Koiran. “Valiant’s model and the cost of computing integers”. In:
Comput. Complexity 13.3-4 (2004), pp. 131–146.
[Kra85] Hanspeter Kraft. Geometrische Methoden in der Invariantentheorie. 2nd.
Vieweg & Sohn Verlagsgesellschaft, 1985.
[Kum15] Shrawan Kumar. “A study of the representations supported by the or-
bit closure of the determinant”. In: Compositio Mathematica 151.2 (2015),
pp. 292–312.
[Lan02] Serge Lang. Algebra (Third Edition). Springer, 2002.
[Lan12] Joseph M. Landsberg. Tensors: geometry and applications. Vol. 128. Graduate
Studies in Mathematics. Providence, RI: American Mathematical Society,
2012.
[Lan15] Joseph Montague Landsberg. “Geometric complexity theory: an introduc-
tion for geometers”. In: Ann. Univ. Ferrara. Sez. VII Sci. Mat. 61.1 (2015),
pp. 65–117.
[Li09] Li Li. “Wonderful compactification of an arrangement of subvarieties”. In:
Michigan Math. J. 58.2 (2009), pp. 535–563.
[LL89] Thomas Lehmkuhl and Thomas Lickteig. “On the order of approximation
in approximative triadic decompositions of tensors”. In: Theoretical Com-
puter Science 66.1 (1989), pp. 1–14.
145
[LMR13] Joseph Montague Landsberg, Laurent Manivel, and Nicolas Ressayre.
“Hypersurfaces with degenerate duals and the geometric complexity
theory program”. In: Comment. Math. Helv. 88.2 (2013), pp. 469–484.
[LP90] M. Larsen and R. Pink. “Determining representations from invariant di-
mensions”. In: Invent. Math. 102.2 (1990), pp. 377–398.
[Mal03] Guillaume Malod. “Polynômes et coefficients”. PhD thesis. L’Université
Claude Bernard Lyon 1, 2003.
[Mar73] Marvin Marcus. Finite dimensional multilinear algebra. Part 1. Pure and
Applied Mathematics, Vol. 23. Marcel Dekker, Inc., New York, 1973,
pp. x+292.
[MFK94] David Mumford, J. Fogarty, and F. Kirwan. Geometric invariant theory.
3rd ed. Vol. 34. Ergebnisse der Mathematik und ihrer Grenzgebiete (2).
Springer-Verlag, Berlin, 1994.
[MM15] Laurent Manivel and Mateusz Michałek. “Secants of minuscule and comi-
nuscule minimal orbits”. In: Linear Algebra Appl. 481 (2015), pp. 288–312.
[MM62] Marvin Marcus and F. C. May. “The permanent function”. In: Canad. J.
Math. 14 (1962), pp. 177–189.
[Mor73] Jacques Morgenstern. “Note on a lower bound of the linear complexity of
the fast Fourier transform”. In: J. Assoc. Comput. Mach. 20 (1973), pp. 305–
306.
[MP08] Guillaume Malod and Natacha Portier. “Characterizing Valiant’s algebraic
complexity classes”. In: J. Complexity 24.1 (2008), pp. 16–38.
[MP13] B. D. McKay and A. Piperno. “Practical Graph Isomorphism, II”. In: J.
Symbolic Computation 60 (2013), pp. 94–112.
[MR04] Thierry Mignon and Nicolas Ressayre. “A quadratic bound for the de-
terminant and permanent problem”. In: Int. Math. Res. Not. 79 (2004),
pp. 4241–4253.
[MS01] Ketan D. Mulmuley and Milind Sohoni. “Geometric complexity theory. I.
An approach to the P vs. NP and related problems”. In: SIAM J. Comput.
31.2 (2001), pp. 496–526.
[MS05] Erza Miller and Bernd Sturmfels. Combinatorial Commutative Algebra.
Springer, 2005.
[MS08] Ketan D. Mulmuley and Milind Sohoni. “Geometric complexity theory. II.
Towards explicit obstructions for embeddings among class varieties”. In:
SIAM J. Comput. 38.3 (2008), pp. 1175–1206.
146
[Mul07] Ketan D. Mulmuley. Geometric complexity theory VI: the flip via saturated and
positive integer programming in representation theory and algebraic geometry.
Tech. rep. TR–2007–04. Computer Science Department, The University of
Chicago, May 2007.
[MV97] Meena Mahajan and V. Vinay. “Determinant: combinatorics, algorithms,
and complexity”. In: Chicago J. Theoret. Comput. Sci. (1997), Article 5, 26 pp.
(electronic).
[New78] P. E. Newstead. Introduction to moduli problems and orbit spaces. Vol. 51. Tata
Institute of Fundamental Research Lectures on Mathematics and Physics.
Tata Institute of Fundamental Research, Bombay; by the Narosa Publishing
House, New Delhi, 1978, pp. vi+183.
[Pro06] Claudio Procesi. Lie Groups An Approach through Invariants and Represen-
tations. Springer, 2006.
[Pyt] Python. Version 3.5.2. Programming Language. Python Software Founda-
tion, July 5, 2016. url:http://www.python.org/.
[Raz03] Ran Raz. “On the complexity of matrix product”. In: SIAM J. Comput. 32.5
(2003), 1356–1369 (electronic).
[Rei16] Philipp Reichenbach. “Stabilisator der Determinante und maximal lineare
Teilräume”. Bachelor Thesis. Germany: TU Berlin, 2016.
[RR97] Alexander A. Razborov and Steven Rudich. “Natural proofs”. In: J. Com-
put. System Sci. 55.1, part 1 (1997). 26th Annual ACM Symposium on the
Theory of Computing (STOC’94) (Montreal, PQ, 1994), pp. 24–35.
[Rys63] Herbert John Ryser. Combinatorial mathematics. The Carus Mathematical
Monographs, No. 14. Distributed by John Wiley and Sons, Inc., New York.
Mathematical Association of America, 1963, pp. xiv+154.
[Sha94] Igor Shafarevich. Basic Algebraic Geometry 1. 2nd. Berlin: Springer, 1994.
[Slo11] N. Sloane. The On-Line Encyclopedia of Integer Sequences, Sequence A003432.
2011. url:http://oeis.org/A003432.
[Spr08] Tonny Albert Springer. Linear Algebraic Groups. Springer, 2008.
[Stu93] Bernd Sturmfels. Algorithms in invariant theory. Texts and Monographs in
Symbolic Computation. Springer-Verlag, Vienna, 1993, pp. vi+197.
[Sym] SymPy. Version 1.0. Software Library. SymPy Development Team, Mar. 10,
2016. url:http://www.sympy.org/.
147
[Tod92] Seinosuke Toda. “Classes of Arithmetic Circuits Capturing the Complex-
ity of the Determinant”. In: IEICE TRANS. INF. & SYST. E75-D.1 (1992),
pp. 116–124.
[TY05] Patrice Tauvel and Rupert W. T. Yu. Lie Algebras and Algebraic Groups.
Springer, 2005.
[Val79a] Leslie G. Valiant. “Completeness Classes in Algebra”. In: Proceedings of the
11h Annual ACM Symposium on Theory of Computing, April 30 - May 2, 1979,
Atlanta, Georgia, USA. 1979, pp. 249–261.
[Val79b] Leslie G. Valiant. “The Complexity of Computing the Permanent”. In:
Theor. Comput. Sci. 8 (1979), pp. 189–201.
[Yu16] Jun Yu. On the dimension datum of a subgroup. Duke Math. J., Advance Pub-
lication, 1st July. 2016.
148
List of Symbols
Inclusion order on generalized partitions (page 137).
Equivalence of complexity measures (page 36).
(P)Row vector of partial derivatives of P(page 59).
[m]The set of natural numbers from 1 to m(page 16).
X//GQuotient of a G-variety Xby G(page 125).
PQThe form Pis in the GL-orbit of the form Q. (page 108).
αXaction morphism of a variety with semigroup action (page 123).
AAdjugate of A, transpose of the cofactor matrix (page 80).
ˆ
APAnnihilator scheme (page 82).
ˆ
Ass
PSemistable part of ˆ
AP(page 82).
AtTranspose of the matrix A(page 41).
APAnnihilator of a form P(page 61).
bdc(P)Binary determinantal complexity of a polynomial P(page 21).
bndThe generic binomial (page 105).
cc(P)Circuit complexity of a polynomial Pwith coefficients in a fixed ring
(usually a field). (page 10).
ccR(P)Circuit complexity of a polynomial Pwith coefficients in R(page 10).
Cdd-th Chow variety, orbit closure of mnd=x1x. . . xd(page 49).
coneQ(S) Rational cone generated by a subset SZn(page 47).
CJtKThe ring of formal power series with complex coefficients (page 77).
C[X]Coordinate ring of an affine variety X(page 123).
DdThe orbit closure of detd(page 45).
DϱDifferential of a morphism ϱbetween varieties (page 79).
149
iPThe i-th partial derivative of a polynomial P(page 58).
wPPartial derivative of a form Pin direction w(page 58).
Pd,mThe orbit closure of ppd,m(page 45).
dc(P)Determinantal complexity of a polynomial P(page 18).
dc(P)Determinantal border complexity of a polynomial P(page 37).
detdThe determinant polynomial of a generic d×dmatrix (page 16).
doc(P)Determinantal orbit complexity of a polynomial P(page 36).
doc(P)Determinantal orbit border complexity of a polynomial P(page 37).
dom(ϖ)Domain of the rational map ϖ(page 81).
End(W)The linear endomorphisms of a vector space W(page 35).
End(W,L)Endomorphisms on Wthat map into the subspace LW(page 89).
GIdentity component of an algebraic group G(page 125).
Γ(φ)Graph of a rational map φ(page 81).
GPStabilizer of a homogeneous polynomial P(page 41).
GaThe additive group (page 128).
GL(W)The general linear group of invertible endomorphisms of W(page 124).
GLnThe group of invertible, complex n×nmatrices (page 124).
glnLie algebra of GLn(page 79).
gl(V)Lie algebra of GL(V)(page 79).
GmThe multiplicaive group (page 128).
HCmHamilton Cycle polynomial (page 31).
InThe n×nidentity matrix (page 47).
idWThe identity map on W(page 59).
im(a)Image of an endomorphism a(page 63).
Irr(G)All equivalence classes of irreducible G-representations (page 130).
|λ|Sum of all entries of an integer vector λZn(page 46).
(λ)Length of a partition λ(page 46).
LUsually a linear subspace of W, often a linear subspace of some hyper-
surface Z(P)W(page 61).
λNegative reverse of a parttion λ(page 135).
Λ+(G)Dominant weights of an affine algebraic group G(page 132).
150
Λ+(Z)Dominant weights in the coordinate ring of a GLn-variety Z(page 46).
Λ(G)Weight lattice of an affine algebraic group G(page 131).
Lie(G)Lie algebra of an algebraic group G(page 79).
Λ+
nGeneralized partitions, the dominant weights of GLn(page 135).
Λ+
nDominant weights of GLnthat yield polynomial representations, parti-
tions of n(page 137).
Λ(Z)Group generated by Λ+(Z)(page 47).
mndThe universal monomial mnd=x1···xd(page 49).
NPNullcone of the action of the stabilizer group on the space of Endomor-
phisms (page 62).
N(Z)Normalization of a variety Z(page 48).
P,(P)Orbit of a homogeneous polynomial (page 39).
ocP(λ)Orbit coefficient of Pat λ(page 42).
P,(P)Orbit closure of a homogeneous polynomial (page 39).
ocP(λ)Orbit closure coefficient of Pat λ(page 40).
PBoundary of the orbit of a homogeneous form (page 60).
PUsually a set of polynomials (page 35).
ΦUsually a root system (page 131).
permThe permanent polynomial of a generic m×mmatrix (page 16).
PQ P is a projection of Qover a fixed ring (page 14).
PQThe two polynomials Pand Qare equivalent over a fixed ring (page 35).
PRQ P is a projection of Qover the ring R(page 14).
PRQThe two polynomials Pand Qare equivalent over the ring R(page 35).
poly The set of polynomially bounded functions NN. (page 12).
poly(n)Expressions that are polynomially bounded in nN. (page 12).
ppd,mPadded permanent polynomial (page 39).
PσThe polynomial that arises from Pby the substitution σ(page 14).
QUsually a set of polynomials (page 35).
Rep(G)All equivalence classes of G-representations (page 130).
R(G)Radical of an algebraic group G(page 128).
rk(a)Rank of an endomorphism a(page 59).
151
Ru(G)Unipotent radical of an algebraic group G(page 128).
R[x]The polynomial ring in a countably infinite set xof variables (page 10).
SFree abelian group generated by S(page 46).
SmThe symmetric group on msymbols (page 16).
Sat(S)Saturation of a semigroup S(page 46).
SLnGroup of complex n×nmatrices with determinant 1, the special linear
group (page 129).
slnLie algebra of SLn(page 81).
supp(P)Support of a polynomial P(page 13).
SymkWThe k-th symmetric power of the vector space W(page 49).
tr(A)The trace of a square matrix A(page 80).
VA complex vector space, usually V=C[W]d(page 57).
V0Polynomial part of a GLn-module V(page 137).
V(ϱ)Vector space of a representation ϱ(page 129).
V
G(ϱ)Vector space of a G-representation ϱ(page 129).
ωPOrbit map of a point/polynomial P(page 61).
ϖPRational projective orbit map of a point/polynomial P(page 61).
WA complex vector space of dimension n(page 57).
xA countably infinite set of formal variables (page 10).
X(G)Character group of G(page 130).
ˇ
X(G)One-paramter-subgrups of G(page 131).
Z(P)Affine vanishing set of a homogeneous form P(page 63).
152
Index
#P (complexity class):15
1-psg: 132
additive group: 130
adjugate: 81,91
affine cone: 60
algebraic group: 126
Lie algebra: 80
algebraic monoid: 126
algebraic semigroup: 125
morphism: 125
Alon-Tarsi conjecture: 46
annihilator: 61,91,118
approximation order: 79
approximation path: 79
linear: 79
order: 79
arithmetic circuit: 10
arithmetic complexity measure: 35
binary determinantal complexity: 21
binary variable matrix: 21
binomial: 107
blowup: 83,85
smooth: 85
border complexity: 37
Borel subgroup: 130
boundary: see also orbit
categorical quotient: 127
center: 83
character: 132
characterized by its stabilizer: 43
Chow variety: 47,49,67
circuit: 10
complexity: 10
computes: 10
constant-free: 30
size: 10
skew: 30
weakly skew: 17
closed: 65
closure: see also orbit
complete: 16
complexity: see also circuit
complexity measure: 35
compression space: 91
computation gate: see also gate
computes: see also circuit
concise: 58,90,107
constant-free: see also circuit
degeneration: 58
linear: 79
degree: 131
depth: 10
determinant: 16
determinantal border complexity: 37
determinantal complexity: 18
determinantal orbit border
complexity: 37
153
determinantal orbit complexity: 36
digraph: 22
value: 22
weight: 22
dimension datum: 43
domain: 82
dominant weights: 134
endomorphism
image: 63
rank: 59
equivalent: see also polynomial
G-variety: 125
morphism: 125
gap: see also semigroup
gate
computation gate: 10
input gate: 10
general linear group: 38,126
generalized partition: 135,137
inclusion: 139
geometric quotient: 128
good quotient: 127
graph: 83
highest weight vectors: 136
homomorphism: 132
image: see also endomorphism
inclusion: see also generalized partition
input gate: see also gate
invariant: 125
irreducible: see also representation
isotypical component: 136
Kronecker product: 81
latin square: 46
length: see also partition
Lie algebra: see also algebraic group
linear: see also approximation path, see
also degeneration
linear subspace
maximal: 62
semistable: 63
unstable: 63
linearized: 129
linearly equivalent: see also polynomial
maximal: see also linear subspace
module: 131
morphism: see also algebraic semigroup
multiplicative group: 130
nonuniform: 13
NP (complexity class):15
nullcone: 62,128
occurrence obstructions: 45
one-parameter-subgroup: 132
orbit: 39,125
boundary: 61
closure: 39
orbit closure coefficient: 40
orbit coefficients: 42
order: see also approximation path
order of approximation: 79
P(complexity class):12
p-family: 12
p-projection: 14
padded permanent: 39
partial derivative: 58,80
partition: 139
length: 46
path value: 22
path weight: 22
permanent: 16
plethysm: 69
polynomial: 139
equivalent: 35
154
linearly equivalent: 110
polynomial part: 67,139
polynomially bounded: 12
polynomially equivalent: 36
polynomially many: 12
polystable: 60
positive: see also root
power series: 77
projection: 14
radical: 130
rank: see also endomorphism, see also
reductive group
reductive: 57,130
reductive group
rank: 134
reflection: 134
representation: 131
irreducible: 132
root: 133
positive: 133
simple: 134
root lattice: 135
saturated: 46
saturation: 46
semigroup
gap: 46
semisimple: 130
semistable: 62,see also linear subspace,
84,129
simple: see also root
singular subspace: 98
size: see also circuit
skew: see also circuit
smooth: see also blowup
stabilizer: 125
stable: 125
subrepresentation: 132
support: 13
support matrix: 26
symmetric power: 49
toric variety: 49
torus: 130
trace: 81
transpose: 41
uniform: 13
unipotent radical: 130
unit group: 126
universal: 18
universal monomial: 49,67
unstable: 62,see also linear subspace
value: see also digraph
VNP (complexity class):14
VP (complexity class):12
weakly skew: see also circuit
weight: see also digraph, 136
weight lattice: 133
weight spaces: 133
Weyl group: 138
155