# A Mind for Madness

## Handlebodies III

I keep naming my posts “handlebodies”, so I think it is officially time to define what one is. A handlebody is a manifold obtained from $D^m$ by attaching various $\lambda$-handles successively. Thus a general handlebody will look like $D^m\cup D^{\lambda_1}\times D^{m-\lambda_1}\cup \cdots \cup D^{\lambda_n}\times D^{m-\lambda_n}$.

If you’re familiar with how to construct a CW-complex, this is pretty similar. You just inductively attach the handles using smooth maps, and then smooth out the manifold so that at each step we have a legitimate smooth manifold. It may be useful to introduce a notation for this. The first attaching $D^m\cup_{\phi_1} D^{\lambda_1}\times D^{m-\lambda_1}$ with attaching map $\phi_1: \partial D^{\lambda_1}\times D^{m-\lambda_1}\to \partial D^m$ will be denoted $\mathcal{H}(D^m; \phi_1)$. So inductively denote the i-th attaching by $\mathcal{H}(D^m ; \phi_1, \ldots , \phi_{i-1}, \phi_i)$.

After i steps, we will always have i attaching maps even if some are formally meaningless (attaching a 0-handle is a disjoint union, so there is no attaching).

If we express M as a handlebody we call that a handle decomposition of the manifold.

Next time I’ll prove the result that everyone that has been reading the posts will have already guessed. Given a Morse function $f: M\to \mathbb{R}$ on a closed manifold, $f$ determines a handle decomposition of $M$. Moreover the handles of this handlebody correspond to the critical points of $f$, and the indices of the handles coincide with the indices of the corresponding critical points.

I’m short on time today, so I’m going to put off proving it.

I’m not sure how much to say about my other news, since it is still sort of up in the air. I passed two of my qualifying exams (which was all that was necessary for now), and I may officially lock myself into the path of algebraic geometry as my field in the next couple of days. Before I say too much about this, I’ll just say that I should have more information on Monday about what I’m officially doing.

## Handlebodies II

Let’s think back to our example to model our $\lambda$-handle (where $\lambda$ is not a max or min). Well, it was a “saddle point”. So it consisted of a both a downward arc and upward arc. If you got close enough, it would probably look like $D^1\times D^1$.

Well, generally this will fit with our scheme. An n-handle looked like $D^n$ … or better yet $D^n\times 0$, and a 0-handle looked like $0\times D^n$, so maybe it is the case that a $\lambda$-handle looks like $D^\lambda\times D^{n-\lambda}$. Let’s call $D^\lambda\times 0$ the core of the handle, and $D^{n-\lambda}$ the co-core.

By doing the same trick of writing out what our function looks like at a critical point of index $\lambda$ in some small enough neighborhood using the Morse lemma, we could actually prove this, but we’re actually more interested now in how to figure out what happens with $M_t$ as $t$ crosses this point.

By that I mean, it is time to figure out what exactly it is to “attach a $\lambda$-handle” to the manifold.

Suppose as in the last post that $c_i$ is a critical value of index $\lambda$. Then I propose that $M_{c_i+\varepsilon}$ is diffeomorphic to $M_{c_i-\varepsilon}\cup D^\lambda\times D^{m-\lambda}$ (sorry again, recall my manifold is actually m-dimensional with n critical values).

I wish I had a good way of making pictures to get some of the intuition behind this across. I’ll try in words. A 1-handle for a 3-manifold, will be $D^1\times D^2$, i.e. a solid cylinder. So we can think of this as literally a handle that we will bend the cylinder into, and attach those two ends to the existing manifold. This illustration is quite useful in bringing up a concern we should have. Attaching in this manner is going to create “corners” and we want a smooth manifold, so we need to make sure to smooth it out. But we won’t worry about that now, and we’ll just call the smoothed out $M_{c_i-\varepsilon}\cup D^\lambda\times D^{m-\lambda}$, say $M'$.

Let’s use our gradient-like vector field again. Let’s choose $\varepsilon$ small enough so that we are in a coordinate chart centered at $p_i$ such that $f=-x_1^2-\cdots - x_\lambda^2 + x_{\lambda +1}^2+\cdots + x_m^2$ is in standard Morse lemma form.

Let’s see what happens on the core $D^\lambda\times 0$. At the center, it takes the critical value $c_i$ and it decreases everywhere from there (as we move from 0, only the first $\lambda$ coordinates change). This decreasing goes all the way to the boundary where it is $c_i-\varepsilon$. Thus it is the upside down bowl (of dimension $\lambda$). Likewise, the co-core goes from the critical value and increases (as in the right side up bowl) to the boundary of a $m-\lambda$ disk at a value $c_i+\delta$ (where $0<\delta<\varepsilon$).

Let's carefully figure out the attaching procedure now. If we think of our 3-manifold for intuition, we want to attach $D^\lambda\times D^{m-\lambda}$ to $M_{c_i-\varepsilon}$ by pasting $\partial D^\lambda\times D^{m-\lambda}$ along $\partial M_{c_i-\varepsilon}$.

So I haven't talked about attaching procedures in this blog, but basically we want a map $\phi: \partial D^\lambda\times D^{m-\lambda}\to \partial M_{c_i-\varepsilon}$ and then forming the quotient space of the disjoint union under the relation of identifying $p\in \partial D^\lambda\times D^{m-\lambda}$ with $\phi (p)$. Sometimes this is called an adjunction space.

So really $\phi$ is a smooth embedding of a thickened sphere $S^{\lambda - 1}$, since $\partial D^\lambda=S^{\lambda-1}$. And the dimensions in which it was thickened is $m-\lambda$. Think about the "handle" in the 3-dimensional 1-handle case. We gave the two endpoints of line segment (two points = $S^0$) a 2-dimensional thickening by a disk.

Now it is the same old trick to get the diffeo. The gradient-like vector field, $X$, flows from $\partial M'$ to $\partial M_{c_i+\varepsilon}$, so just multiply $X$ by a smooth function that will make $M'$ match $M_{c_i+\varepsilon}$ after some time. This is our diffoemorphism and we are done.

## Handlebodies I

We now come to the main point of all these Morse theory posts. We want to somehow figure out what a closed manifold looks like based a Morse function that it admits (who knows how long I’ll develop this theory, maybe we’ll even get to how Smale proved the Poincare Conjecture in dimensions greater than or equal to 5).

Suppose $M$ is closed and $f:M\to\mathbb{R}$ a Morse function. We’ll use the convenient notation $M_t=\{p\in M : f(p)\leq t\}$. So again, with the height analogy, as t increases, we will be looking at the entire manifold up to that height. Since M is compact, there is some finite interval $[a,b]$ such that $M_a=\emptyset$ and $M_b=M$.

Note that with essentially no modification, we have already proved the Theorem that if $[c,d]$ contains no critical values, then $M_c\cong M_d$. So really, the point is to now figure out what happens as we pass through the critical values.

First off, there are only finitely many critical points, and we can assume that each of these has distinct critical values by raising and lowering critical values. So if $p_0, \ldots, p_n$ are the critical points and $c_k=f(p_k)$, we can order the indices so that $c_0 < c_1 < \cdots < c_n$.

To be explicit, $c_0$ is the min, so $M_t=\emptyset$ for $t < c_0$ and $M_t=M$ for t greater than $c_n$, since $c_n$ is the max (also, wordpress hates inequalities, or me, I haven't decided yet, but it always cuts out lots of stuff and I just have to write the inequality in words).

These two critical points would be a nice place to start our examination. By the Morse lemma and the fact that a min has index 0, we know that there exists a neighborhood of $p_0$ on which $f=x_1^2+\cdots + x_m^2+c_0$ (Alright, I’m sorry about that, but I just realized I have n critical points, so the dimension of my manifold is now m).

More explicitly there is some $\varepsilon>0$ such that $M_{c_0+\varepsilon}=\{(x_1, \ldots , x_m) : x_1^2+\cdots + x_m^2\leq \varepsilon\}\cong B^m$. So if we are thinking of height (of a 2-dim manifold), we’ll want to visualize this as a “bowl” where you have the bottom of the bowl the min and then it slopes upward along a sphere, and then you have the boundary circle at height $c_0+\varepsilon$.

So note that the only thing we used about this critical point is that it had index 0. This shape is called a (m-dimensional) 0-handle.

The reverse happens at our max. We have $M_{c_n-\varepsilon}=\{(x_1, \ldots , x_m) : x_1^2+\cdots +x_m^2\geq \varepsilon\}$, since the critical point has index m. This is an $m$-handle and thinking in 2-d height, it is a downward facing bowl.

Again, there is nothing special about being the absolute max, any index m critical point will locally be an $m$-handle.

Index k critical points where $k\neq 0,m$ are more complicated so I’ll leave those for next time.

Now we have a nice overview of how this will work. We just need to figure out what a $k$-handle looks like, then as t increases through a critical value with index k, $M_t$ will “attach a k-handle”. When we are not near a critical value, the $M_t$ will not change diffeomorphism-type. We just need to make this a little more precise next time (or maybe even the time after).

## Altering the Critical Points

I officially have a new favorite search for which someone found this blog: How to write a Japanese satire.

Let’s introduce a new term. Two Morse functions are considered equivalent if they have the same critical points and same index at each critical point.

The hope here is that two equivalent Morse functions will give the same topological data about our manifold, and so we want to develop techniques of altering our Morse function to something extremely nice to work with, but having it be equivalent to the origin one.

Our first excursion into this technique is the following: If M is a compact manifold and $f$ is a Morse function on M, then we can find an equivalent Morse function $g$ such that all the critical values are distinct.

If we’re going back to the height intuition, this is the technique that corresponds to “raising” or “lowering” critical points. So if you have two strange things happening at the same height (two mountain peaks that have the same height), the idea is sort of that you can slightly move the manifold around so that one is now higher than the other. Of course, we won’t actually move the manifold in any real sense, we’re going to construct the function.

This is going to be really nice, because it says that we can always get a Morse function in which only a single “change” can happen at any given height.

We’ll do this by first proving a Lemma which does all the work for us. Let $f$ be our Morse function, and $p$ a critical point. Then there is some $\varepsilon>0$ such that for all $c\in (-\varepsilon, \varepsilon)$ there is an equivalent Morse function $h$ that has the same critical values as $f$, except for $h(p)=f(p)+c$.

The arguments here are essentially the same as in previous posts, so I’ll be a little looser and only outline the proof.

Since the critical points are isolated we can take a small coordinate chart centered at $p$ that contains no other critical points. Now let $\psi$ be a bump function that is 1 on some small neighborhood of $p$ and dies to zero before getting to the edge of the chart.

Then we define $h_c=f+c\psi$. We definitely have that all the critical points of $f$ are still critical points of $h_c$ and since on a neighborhood of any of those points the functions either agree or differ by adding a constant, they have the same index. Also, $h_c(p)=f(p)+c$, so we have constructed our desired function as long as we don’t have any extra critical points.

But in the same was as before, $\Big|Dh_c\Big|=\Big|Df+cD\psi\Big|\geq \delta-ca>0$ for all $|c|<\varepsilon$ where $\varepsilon=\delta/a$, since we're only concerned with the compact set on which $\psi$ is decaying, $Df$ has a positive min $\delta$, and $D\psi$ has a finite max $a$. Thus we do not gain any critical points in that set and we are done.

To get to the whole theorem all we need to do is note that there are only finitely many critical points (since compact). So if any of the values are shared, we can use the lemma to give an equivalent Morse function with shifted critical value, where we shift by a small enough value that it can't make it to any other critical value. We only have to apply this a finite number of times.

## Gradient-Like Vector Fields Exist

Now we want to start building some technique that will allow us to figure out what our closed manifold looks like based on the Morse functions it admits.

We’ll call a vector field $X$, a gradient-like vector field for f, if $X\cdot f>0$ away from critical points, and if $p\in M$ is a critical point of index $\lambda$, then there is a coordinate neighborhood about $p$ such that f has the standard form as in the Morse lemma, and $X=-2x_1\frac{\partial}{\partial x_1}-\cdots - 2x_{\lambda}\frac{\partial}{\partial x_\lambda}+2x_{\lambda+1}\frac{\partial}{\partial x_{\lambda+1}}+\cdots + 2x_m\frac{\partial}{\partial x_m}$ (i.e. it is the gradient in this neighborhood).

Intuitively, if we think back to our example, we visualize Morse functions as “height functions”. So we are attempting to construct in some sense an everywhere “upward” pointing vector field. If we’re thinking of the entire manifold flowing along this, then the only places where it is allowed to get “stuck” is at the critical points of $f$.

The theorem is that there always exists a gradient-like vector field for a Morse function on a compact manifold.

Proof: As before, let $\{U_i\}_1^k$ be a finite subcover of coordinate charts, and $\{K_i\}_1^k$ be a compact refinement. Since the critical points are isolated (immediate corollary to the Morse lemma), there can only be finitely many since our manifold is compact. So we can assume that each critical point has a neighborhood small enough so that it is entirely contained in exactly one of the $U_i$, and that the $U_i$ were chosen so that $f$ has standard form in those coordinates.

Let $\psi_i: U_i\to \mathbb{R}$ be a bump function for $K_i$ supported in $U_i$. Then we get a smooth function on the entire manifold by letting $\psi_i\equiv 0$ outside of $U_i$.

Let $X_i$ be the gradient of $f$ on $U_i$. Let $\displaystyle X=\sum_{j=1}^k \psi_jX_j$. The claim is that this is our gradient-like vector field for $f$.

Let’s check $X\cdot f$ at non-critical points. If $x\in M$ is not a critical point, and $x\in U_i$, then $(\psi_i X_i\cdot f)(x)>0$ since $X_i$ is the gradient and $\psi_i(x)>0$. All other terms of the sum are 0 since $\psi_i(x)=0$ for any $i$ such that $x\notin U_i$. Thus $(X\cdot f)(x)>0$.

The other condition we have set up to work since each critical point has a neighborhood that is contained in precisely one of the $U_i$, thus on that neighborhood $f$ is in standard form, and $X=\psi_iX_i$ which is of the correct form. Thus $X$ is gradient-like for $f$.

As a preview of things to come, I’ll prove our first result about what our manifold looks like using Morse functions. This is often called the Regular Interval Theorem.

Suppose that $f$ has no critical value in $[a,b]$, then $M_{[a,b]}=\{p\in M : a\leq f(p)\leq b\}$ is diffeomorphic to $f^{-1}(a)\times [0,1]$.

Let $X$ be gradient-like for $f$. Define $\displaystyle Y=\frac{1}{X\cdot f}X$ which is smooth off of the critical points of $f$, but since $M_{[a,b]}$ contains no critical points it is a smooth vector field there (in fact, on an open set containing $M_{[a,b]}$).

Let $\theta^p(t)$ be an integral curve for $Y$ starting at $p\in f^{-1}(a)$. But now $\displaystyle \frac{d}{dt}\Big|_{t=t_0}f(\theta^p(t))=\frac{d\theta^p}{dt}(t_0)(f)$
$\displaystyle = Y_{\theta^p(t_0)}(f)$
$\displaystyle = \frac{1}{X\cdot f}X\cdot f=1$.

Thus, the integral curve continues along at constant speed 1 for the entire time it is in $M_{[a,b]}$. But it starts at $f=a$ at time 0, so it reaches $f=b$ at time $t=b-a$.

Thus $h: f^{-1}(a)\times [0,b-a]\to M_{[a,b]}$ by $(p,t)\mapsto \theta^p(t)$ is a diffeomorphism. But rescaling gives the diffeo to $f^{-1}(a)\times [0,1]$.

This basically says that between critical points of a Morse function, we must have the manifold looking like cylinder built off of a single slice of the function (if we’re thinking in terms of height, we can pick any height, and at anywhere between the two nearest critical heights, all the level sets will look the same).

## Morse Functions Exist

The astute reader at this point may be getting a little anxious that despite the fact that I found Morse function in two easy low dimensional cases, my eventual goal of saying very general things about manifolds by using Morse functions is going to rely on the fact that they exist.

If these thing are really as powerful as I have been making them out to be, then it would seem that there probably isn’t an abundance of them. But surprisingly, it turns out that basically every smooth function is Morse.

Let $M^n$ be a closed manifold, and $g:M\to \mathbb{R}$ be a smooth function. Then there is a Morse function $f:M\to\mathbb{R}$ arbitrarily close to $g$.

Recall Sard’s Theorem (I’m assuming some familiarity with it, which is probably not a good idea): The set of critical values of a smooth map $f: U\to \mathbb{R}^n$ has measure zero in $\mathbb{R}^n$.

Now we’ll first need a lemma. Let $U\subset \mathbb{R}^n$ be an open set and $f:U\to\mathbb{R}$ a smooth function. Then there are real numbers $\{a_k\}$ such that $f(x_1, \ldots, x_n)-(a_1x_1+a_2x_2+\cdots + a_nx_n)$ is a Morse function on $U$. We can also choose $\{a_k\}$ to be arbitrarily small in absolute value.

Let $p\in U$ be a critical point of $f$. Define $h=Jac(f)^T$ (a smooth map $h:U\to\mathbb{R}^n$). Then $Jac(h)\Big|_p$ is the Hessian $H_f(p)$. Thus, p is a critical point of $h$ iff $det(H_f(p))=0$.

By Sard’s Theorem, we can choose $a=(a_1, \ldots , a_n)\in\mathbb{R}^n$ where each $a_k$ have arbitrarily small absolute value such that $a$ is not a critical value of $h$.

The claim is that $\overline{f}(x_1, \ldots , x_n)=f(x_1, \ldots, x_n)-(a_1x_1+\cdots + a_nx_n)$ is a Morse function on U.

Well, if $p$ is a critical point of $\overline{f}$, then since $\frac{\partial \overline{f}}{\partial x_i}\Big|_p=\frac{\partial f}{\partial x_i}\Big|_p - a_i=0$, by the definition of h, we get $h(p)=a$.

But we chose $a$ to not be a critical value of h. Thus, p is not a critical point of h. So as noted, $det(H_f(p))\neq 0$. But $H_f(p)=H_{\overline{f}}(p)$, so $p$ is a non-degenerate critical point. Since p was an arbitrary critical point, all critical points are non-degenerate and hence $\overline{f}$ is Morse, completing the proof of the Lemma.

We also need another Lemma. Let $K\subset M$ be a compact subset. Then if $g:M\to\mathbb{R}$ has no degenerate critical points in $K$, then we can choose $\varepsilon >0$ small enough so that any $C^2$ approximation of $g$ also has no degenerate critical points in $K$.

Since our manifold is closed, it is compact. So we can choose a finite subcover of coordinate charts, and compactly refine it (I’ll do this construction if someone asks in the comments), so that $\{U_i\}_1^m$ cover $M$ and there are compact sets $K_i\subset U_i$ such that $\cup K_i=M$.

But with this, we can look at any of the $U_k$, and in these coordinates, $g$ has no degenerate critical points in $K\cap K_k$ (alright, that was probably a poor choice of notation) iff $\displaystyle\Big|\frac{\partial g}{\partial x_1}\Big|+\cdots + \Big|\frac{\partial g}{\partial x_n}\Big|+\Big| det(H_g)\Big|>0$ for every point in $K\cap K_k$.

But for a small enough $\varepsilon$ we can definitely still make that inequality hold for any $C^2$ approximation. Thus we have proved the lemma.

Now let’s do the actual existence proof. Take the $U_i, K_i$ as before. We will inductively build our $C^2$ approximations on $C_l=K_1\cup \cdots \cup K_l$. Our base step is to build $f_0$ on $C_0=\emptyset$, so we’re done.

For our inductive hypothesis, suppose we have $f_{l-1}:M\to\mathbb{R}$ having no degenerate critical points in $C_{l-1}$.

Let’s work with the coordinate neighborhood $U_l$ with coordinates $(x_i)$. By the first lemma, there are arbitrarily small numbers $\{a_i\}$ so that $f_{l-1}(x_1, \ldots , x_n)-(a_1x_1+\cdots + a_nx_n)$ is Morse on $U_l$. But note, we only have a definition on $U_l$ and we need one everywhere.

Let $\psi$ be a bump function that is 1 on $K_l$ and supported in $V$, where $K_l\subset V\subset U_l$.

Define $f_l=\begin{cases} f_{l-1}-\psi\cdot (a_1x_1+\cdots a_nx_m) & in \ U_l \\ f_{l-1} & outside \ V\end{cases}$.

(So I have this same cases problem again, just ignore the “line break” symbol, it is actually readable this time).

This gives us a nice well-defined function on all of $M$ (just need to check the overlaps). Also $f_l$ is our first lemma function on $K_l$, so it is Morse on $K_l$ and hence has no degenerate critical points there.

Since $0\leq \psi \leq 1$ (and we’re on a compact set), we can make $\{a_i\}$ small enough so that $f_l$ is an arbitrarily close $C^2$ approximation of $f_{l-1}$ (I won’t do this since it is fairly long and tedious, but quite straightforward for the reasons I gave).

But now by the second lemma, since $f_{l-1}$ has no degenerate critical points in $C_{l-1}$, we have that $f_l$ has no degenerate critical points in $C_{l-1}$ either. We already checked on $K_l$, and thus there are no deg. critical points on $C_{l-1}\cup K_l=C_l$.

Thus inductively we can get a Morse function on all of $M$ that is $C^2$-close to our original smooth function.

## The Morse Lemma

Today we prove what is known as The Morse Lemma. It tells us exactly what our Morse function looks like near its critical points.

Let $p\in M$ be a non-degenerate critical point of $f:M\to \mathbb{R}$. Then we can choose coordinates about p, $(x_i)$, such that in these coordinates $f=-x_1^2-x_2^2-\cdots -x_\lambda^2+x_{\lambda+1}^2+\cdots +x_n^2+f(p)$. Moreover, $\lambda$ is the index of the critical point. (Note that $0\mapsto f(p)$).

Proof: Choose local coordinates, $(x_i)$, centered at $p$. Without loss of generality $f(p)=0$ by replacing $f$ with $f-f(p)$. Thus in coordinates, since p corresponds to 0, $f(0)=0$ (it is a little sloppy, but I’ll probably call the actual function and the function in coordinates the same thing and go back and forth).

By a general theorem of multi-variable calculus (I don’t know if it has a name, it might be Taylor’s theorem? I always get confused at how much is actually included in that), we have smooth functions $g_1, \ldots, g_n$ such that $f(x_1, \ldots, x_n)=\sum_{k=1}^n x_ig_i(x_1, \ldots, x_n)$ and $\displaystyle \frac{\partial f}{\partial x_i}\Big|_0=g_i(0)$.

But 0 is a critical point of $f$, so $g_i(0)=0$ and we can apply the theorem again to each $g_i$. We’ll suggestively call the smooth functions $g_k(x_1, \ldots, x_n)=\sum_{i=1}^n x_i h_{ki}(x_1, \ldots, x_n)$.

Thus, we now have $\displaystyle f=\sum_{k,i}x_kx_i h_{ki}$. Let $\displaystyle H_{ki}=\frac{(h_{ki}+h_{ik})}{2}$.

Then $\displaystyle f=\sum_{k, i}x_kx_i H_{ki}$, and $H_{ki}=H_{ik}$.

But in that form we see that the second partial derivatives are $\displaystyle \frac{\partial^2 f}{\partial x_k \partial x_i}\Big|_0=2H_{ki}(0)$.

By assumption $0$ is a non-degenerate critical point, so $det(H_{ki}(0))\neq 0$ and hence we can apply a linear transformation to our current coordinates and get that $\frac{\partial^2 f}{\partial x_1^2}\Big|_0\neq 0$. Thus $H_{11}(0)\neq 0$.

Now $H_{11}$ is continuous, so that means it is non-zero in a neighborhood of 0.

Let $(y_1, x_2, \ldots, x_n)$ be a new coordinate neighborhood where $y_1=\sqrt{|H_{11}|}\left(x_1+\sum_{i=2}^n x_i\frac{H_{1i}}{H_{11}}\right)$. (Note this is actually a coordinate system, since the determinant of the Jacobian of the transformation from this one to the old one is non-zero).

Now $\displaystyle y_1^2=|H_{11}|\left(x_1+\sum_{i=2}^nx_i \frac{H_{1i}}{H_{11}}\right)^2$
$= H_{11}x_1^2 + 2\sum_{i=2} x_1x_i H_{1i} +\left(\sum_{i=2} x_i H_{1i}\right)^2/H_{11}$ if $H_{11}>0$, and the same thing with minus signs everywhere if $H_{11}$ is negative.

Thus the function is $y_1^2+\sum_{i,j=2}x_ix_jH_{ij}-\left(\sum_{i=2} x_i H_{1i}\right)^2/H_{11}$ if $H_{11}>0$ or
$-y_1^2 +\sum_{i,j=2} x_ix_j H_{ij} -\left(\sum_{i=2}x_i H_{1i}\right)^2/H_{11}$ otherwise.

(I awkwardly wrote this with words, because I couldn’t get cases to look right, and was having weird errors I couldn’t figure).

Now just isolate the stuff after the $\pm y_1^2$. It satisfies the same conditions as $f$, but has fewer variables, so we can induct on the number of variables until we have $f(y_1, \ldots , y_n)=-y_1^2-\cdots - y_\lambda^2 +y_{\lambda +1}^2+\cdots +y_n^2$.

And since the plus and minus signs came from changing basis to put the Hessian into diagonal form with plus and minus 1′s, the number of minus signs is indeed the index.

The proof of this tended to be sort of tedious to check everything, so don’t worry if you didn’t go through it. I don’t think there is really insight you get from going through it. This is one of those rare instances that I think the result is more important than the proof.

Now we have real good reason to believe the index will be $n$ or 0 if we are at a local max or min. What does a max or min look like near the point? Well, it slopes all in the same direction, i.e. it will locally look like a sphere. But this is exactly what the Morse lemma tells us about index n and 0 critical points. We’ll make this more precise later.

I wasn’t sure how I was going to proceed. My two options seemed to be to build the Morse theory I need for Lefschetz, and then do Lefschetz, then come back to Morse theory. But I think I’m just going to continue as far as I want to go ignoring what is needed for the Hyperplane Theorem, then reference what I need.