Ask HN: Learn Math the Hard Way

drostie · on July 8, 2012

You may not come from the same culture that I came from, but the early significant milestones in my mathematical education were:

    1. Reading the "aha!" books by Martin Gardner, as a child.
    2. Reading Lockhart's "A Mathematician's Lament" [1]
    3. Linear algebra, calculus, and complex analysis classes. I was taught
    these at Cornell; you might look for them via MIT's OpenCourseWare [2]. 
    4. A bunch of combinatorics, from Cornell's classes in probability.

The most key thing I can tell you about mathematics is, always go back and read the definitions. Always. I have sometimes helped out math students on topics which were way out of my depth, simply because I picked up their textbooks, flipped several pages back, and said "hm, what does this word mean? what does that word mean?", building up connections all the while.

You may also want to start in a completely different direction, since you're interested more in CS-based topics (I am interested more in physics-based topics) by starting with number theory and modular arithmetic, and of course MIT has a Comp Sci section [3] which also produces lectures on OpenCourseWare; you may wish to watch those lectures.

[1] http://www.maa.org/devlin/LockhartsLament.pdf

[2] http://ocw.mit.edu/courses/mathematics/

[3] http://ocw.mit.edu/courses/electrical-engineering-and-comput...

dwc · on July 8, 2012

FYI, Lockhart has a new book coming[1] out with the kinds of things we should be doing, which is something I (and probably everyone who read & liked his Lament) have been pining for. I'm waiting like everyone else, so I can't explicitly recommend it. But I certainly anticipate a mental treat.

1. http://www.amazon.com/Measurement-Paul-Lockhart/dp/067405755...

hendzen · on July 8, 2012

Start with these topics. Work your way through them, and make sure you understand everything you have read so far before progressing. Do the exercises, or it will be a waste of time.

0) Algebra, Trigonometry, Calculus

Make sure you have a decent grasp over high school level math topics. You might not need to use these topics frequently (though trig comes up a surprising amount), but they are necessary to establish a base level of mathematical maturity.

1) Linear Algebra

Obviously very important if you want to do any 3d work, but it comes up in later topics like graph theory.

Suggested book: Linear Algebra and Its Application - Strang

2) Discrete Math

You need an understanding of proofs and logic before you can get to the real algorithms material. Counting and probability is also very important.

Suggested book: Discrete Mathematics and Its Applications - Kenneth Rosen

This looks like a good free option, but there are no exercises: http://courses.csail.mit.edu/6.042/fall10/mcs-ftl.pdf

3) Algorithms

Enough said.

Suggested book: CLRS, but here's a free option: http://www.cs.berkeley.edu/~vazirani/algorithms.html

This should take you some time as each of these topics usually corresponds to a college class. When you finish with those, start reading about CS Theory, Combinatorics, or pick up a graduate Algorithms text.

ekm2 · on July 8, 2012

Another Linear Algebra book that also has a good introduction to proofs is Sheldon Axler's Linear Algebra Done Right.

sshrin · on July 8, 2012

You say 'trig comes up a surprising amount'. It would be great if you could give a few examples of these instances. I have always thought of trigonometry as one of the areas I really enjoyed in school but something I have never used ever since.

hendzen · on July 8, 2012

Two recent projects come to mind, one where I was working on a program to draw and manipulate points in 2d space where I used rotation matrices, and another where I was testing the graphing routines in a monitoring service by sending load in sinusoidal spikes.

Dn_Ab · on July 8, 2012

I think the surprising commonality is conditional. So you won't see it unless you are focusing on physics or signal processing.

ten_fingers · on July 9, 2012

Trig? Fourier series and its introduction to functional analysis and Hilbert space and the ideas of inner products and orthogonality there. JPG image compression. Shannon's information theory. Characterization of time invariant linear systems, e.g., acoustics. Antenna theory. Phased arrays and beam forming. Imaging in optics. Holography. The fast Fourier transform. Seismic data analysis via the fast Fourier transform. Characteristic functions and Bochner's theorem. Power spectra. X-ray crystallography. The fundamental theorem of interpolation and the Nyquist sampling theorem (e.g., how music CDs work).

There's a cute idea to put 100 wireless customers all on the same wavelength. Basically have a tricky antenna pattern with a lobe for each user and so that each user gets only their own signal. It's all trig.

Dn_Ab · on July 9, 2012

Indeed, I agree. Those all fall under the umbrella I gave. But those are all particular subject areas that many people can go through their career without touching.

ten_fingers · on July 9, 2012

> Those all fall under the umbrella I gave.

Yes, all or nearly all! That the trig functions are the source of the most important orthogonal basis in Hilbert space theory need not be good to know just for physics or signal processing!

Let's see: Also I mentioned characteristic functions and Bochner's theorem which are core probability and not just physics or signal processing. Characterization of time invariant linear systems might be in mechanical engineering and might be tried even in economics! Seismic data analysis via the fast Fourier transform is, yes, in signal processing but is also mostly regarded as geology or just looking for oil or anything 'down there'.

I tried, as you can see, I really tried, to show how trig was for more than just your father's topics in physics and signal processing. I tried!

1331 · on July 8, 2012

_Concrete Mathematics_ [1] is a good book that is fun to work through. The book can be considered an introduction/prerequisite to Knuth's _The Art of Computer Programming_ series [2], which contains much mathematics. If you would like a book that is not so difficult, _Introduction to Algorithms_ [3] (aka "CLRS") is also highly recommended.

[1] http://www.amazon.com/dp/0201558025/

[2] http://www.amazon.com/dp/0321751043/

[3] http://www.amazon.com/dp/0262033844/

zedshaw · on July 8, 2012

Most math books suck. The one exception, and the first book that made me realize you can teach complex topics to regular folks, is Calculus Made Easy by Silvanus P. Thompson and Martin Gardner. It's been around since the late 1800's and it's what I finally learned calculus from.

I haven't found many other books that explain calculus as well as that one. If you get that and then a book of exercises from Schaum's outlines, you'll be able to get pretty good at it. Also, the book is a really good read for a math book.

chris_wot · on July 8, 2012

Calculus: An Intuitive Approach by the late Morris Klein is pretty good too. I found it refreshingly clear!

nimeshneema · on July 8, 2012

The best text-book I can suggest for you is: "How to Think About Algorithms - Loop Invariants and Recursion" By Jeff Edmonds.

It takes you through all the relevant mathematics required to learn about basics of CS based topics you are looking for (algorithms, proofs and other practical applications).

It also acts as a good primer for learning basic Computer Algorithms. The USP of book is it's noob friendly and I think perfectly suited to your needs.

Check it out here: http://www.amazon.com/Think-About-Algorithms-Jeff-Edmonds/dp...

ether · on July 8, 2012

I don't think you need to learn really hard core math to do CS. In my experience, the one most important "genre" of math was discrete mathematics. My textbook was Elements of Discrete Mathematics, and I think it was a really great intro: http://www.amazon.com/Elements-Discrete-Mathematics-Computer...

madmanwalking · on July 8, 2012

The Khan academy (www.khanacademy.org) has plenty of videos and exercises, and if you're looking more at the CS side of things udacity.com runs a few online courses looking at algorithms, discrete maths and the like.

tonetheman · on July 8, 2012

This was going to be my suggestion. Excellent resource.

mdkess · on July 8, 2012

When I was in early university, I went through Introduction to Algorithms and chapter by chapter, I implemented nearly every algorithm presented in the book. At the time, I had no idea why I was doing it (fun? ACM practice?), but in hindsight it was incredibly important. I learn by doing, not by reading (at least doing cements what I read), so I make sure that I incorporate a lot of practice into my plans.

viscanti · on July 8, 2012

That's really the basis for the "Learn X the hard way" tutorials. There's a number of books that propose to teach a topic quickly/easily, with the shortcut being that you won't have to go through all the exercises. It turns out that's an ineffective teaching style for most computer/science topics. For many of these topics, you really can't short-cut that process of implementing solutions and learning from doing.

Someone · on July 8, 2012

For getting to know what is a proof in Mathematics, I recommend Proofs and Refutations by Imre Lakatos. It will not teach you many theorems (in fact, 80% of it only proofs one, http://en.wikipedia.org/wiki/Euler_characteristic again and again and again), and it will almost certainly not be applicable to writing programs but it will give insight into how mathematicians in the last century have made mathematics much more rigorous (and abstract).

elssar · on July 8, 2012

Well you haven't mentioned how much math you're familiar with. But I'll assume that you know school level mathematics and are looking to learn more about the math mostly used in computer science. Get Concrete Mathematics( http://www.amazon.com/Concrete-Mathematics-Foundation-Comput... ) by Graham, Knuth & Patashnik - It's based on the course taught at Stanford since 1970 & it's really really good. Another book I think you should get is Meta Math: The quest for Omega( http://www.amazon.com/Meta-Math-Quest-Gregory-Chaitin/dp/140... ) by Gregory Chaitin. It's not really a math course book, and is mostly the authors take on his favorite topics in math, but if you want to improve your thinking process then this book will definitely give you a different perspective.

alex-g · on July 8, 2012

You might like "To Mock a Mockingbird" by Raymond Smullyan. It's a book of logic puzzles with a focus on combinatory logic (a model of computation related to the lambda calculus). Aside from the logic/math/CS nature of the content, the progression of the exercises should help with mathematical-style reasoning in general.

psykotic · on July 8, 2012

As fond as I am of the Mockingbird book and Smullyan's books in general, I'd say that most of its puzzles are pure exercises in abstract symbol manipulation and not terribly enlightening.

If you have some prior exposure to the SKI combinators, one big aha-moment may come from the book's exploration of several alternative axiom systems that, while preceding the SK system chronologically, are often left out of modern CS-oriented treatments of combinatory logic.

Also, while I think the exercises on symbol manipulation are not very enlightening on their own, the syntactic rewriting point of view is a useful complement if your view of combinatory logic has been mostly semantic. Proving completeness in the syntactic framework becomes a matter of showing that any combinatory rewriting rule can be expressed in terms of your base combinators. This in turn can be done if your combinators allow arbitrary permutations to be expressed in addition to duplication and dropping of symbols (cf. lengthening and shortening rules in GEB). For example, K is a dropping combinator, and rather than viewing S as a substitution operator on the semantic level (as is done when translating lambda calculus to SK calculus), it can be viewed as a hybrid permutation and duplication combinator at the syntactic level. If these dual roles are split into separate combinators, you get alternative axiom systems.

gmoes · on July 8, 2012

Here's a post I did on free math resources, including a list of resources on various topics:

http://www.elegantcoding.com/2011/06/free-math-resources-you...

There's some really great stuff listed there, I use it all the time.

Arun2009 · on July 8, 2012

In addition to Concrete Mathematics by Knuth, which contains a lot of Mathematical tools and techniques useful in areas of Computer Science, I'd recommend the following to learn how good problem solvers approach Mathematics:

1. The Art and Craft of Problem Solving by Paul Zeitz.

2. How to Solve It by George Polya.

I'd also recommend learning any of the competition Mathematics topics like number theory, combinatorics or geometry as base material to improve your problem solving skills. I have also found the Schaum's series on Combinatorics and Graph Theory concise and useful.

jasomill · on July 8, 2012

The freely available [1] is a nice supplement to some of the material in Concrete Mathematics, and this [2] is an approachable introduction to graph theory with side notes on both proof techniques and algorithms.

More generally, if you're not already comfortable with linear algebra (a couple college semesters "or equivalent experience"), I'd recommend both [3] and [4] for two entirely different perspectives. For modern algebra more generally, I'm a huge fan of [5].

If I could only take one introductory mathematics book to a desert island, I'd cheat a bit and take [6], [7], and [8]. While never directly involved in CS, Courant was very interested in the practical applications of theoretical mathematics, see, e.g., [9] and, well, most everything else he wrote.

I could go on and on, so I'll stop here.

[1] http://www.math.upenn.edu/~wilf/DownldGF.html [2] http://www.amazon.com/Graph-Theory-Graduate-Texts-Mathematic... [3] http://www.amazon.com/Linear-Algebra-Applications-Gilbert-St... [4] http://www.amazon.com/Finite-Dimensional-Vector-Spaces-P-R-H... [5] http://www.amazon.com/Algebra-Chelsea-Publishing-Saunders-La... [6] http://www.amazon.com/Introduction-Calculus-Analysis-Classic... [7] http://www.amazon.com/Introduction-Calculus-Analysis-Classic... [8] http://www.amazon.com/Introduction-Calculus-Analysis-Classic... [9] http://www.ams.org/journals/bull/1943-49-01/S0002-9904-1943-...

phektus · on July 8, 2012

http://www.reddit.com/r/math/comments/eohrr/to_everyone_who_...

opminion · on July 8, 2012

The fist item in that list is learn algebra, and rightly so.

It then follows something similar to a university-level curriculum in Europe (continental, perhaps not UK), which might lead all the way to research maths, but perhaps it is not the most appropriate path for computer science because it entirely misses the mathematical concept of computability.

jcdreads · on July 8, 2012

The Project Euler problems might be right up your alley. They are little programming exercises that start easy and rapidly become harder. Along the way you figure out enough math to make the programming easier, or at least enough to make better algorithms. The forums are enormously helpful. http://projecteuler.net/

tempire · on July 8, 2012

Math: http://khanacademy.org

Computer Science: http://aduni.org

Another vote for khanacademy. I did this myself, and worked out a plan to put every piece of information into long term memory. When you're motivated and not limited by century-old university bureaucracy, you can move very quickly, and run circles around people with a "real" degree, most of whom have forgotten everything but the general concepts.

Udacity is also coming up as an excellent resource, but the important thing to realize is that every resource adds a different perspective on the matter that is extremely valuable.

For CS-specific information, aduni.org is a glorious resource. It's from 2001, but the CS basics never change. I'd recommend getting a handle on all the math from khanacademy so that you understand everything they're referencing.

verganileonardo · on July 8, 2012

I have a business background and use Project Euler to evolve my programming, math and problem solving skills. :)

projecteuler.net

big_data · on July 8, 2012

+1 for Project Euler! It totally rocks!

sidman · on July 8, 2012

As mentioned probabilities is something you want to get into very early. Its easy to understand at the start and its practical uses in many different computing areas along with many other things in life is priceless.

Good place to start would be to go through khan academy's probability videos.

elrzn · on July 10, 2012

Take a look a Precalculus Mathematics in a Nutshell for a quick review. You'll be surprised how much stuff you forgot already!

Then you can move to a discrete math book like Discrete Mathematics with Applications or Mathematics for Computer Science (free pdf courses.csail.mit.edu/6.042/fall10/mcs-ftl.pdf) and any Calculus book of your liking.

There are also great resources to learn Math using Haskell. I'm working through The Haskell Road to Logic, Maths and Programming and it's proving to be great http://www.haskell.org/haskellwiki/Books_and_tutorials/Mathe...

azsxdc · on July 8, 2012

I was in a similar position 3 years ago. I jumped into an open university maths degree at the second year, have done half of the final year and have moved onto a masters degree. All part time, for me this was the best way, as another commentator has said, it is easy to get lost among the vastness of the topic - I am still pulling together the threads of how all these things fit together. I would recommend the journey for the reasons you list, also it has proven useful in pursuing my interest in machine learning and general artificial intelligence. Take the plunge, you will probably not be disappointed.

pdelgallego · on July 8, 2012

I am in the ship, I have signed in the statistics 101 class [1] at udacity. Its very basic, but it's a good starting point. Coursera have also several math related courses [2] (calculus, autmata, logic, ...).

I have also signed in a class in nearest community college in my area.

[1] http://www.udacity.com/overview/Course/st101/CourseRev/1

[2] https://www.coursera.org/category/math

Happy logarithmic!

radicalbyte · on July 8, 2012

Coursera's Automata course is really good, but it's hard work. It's a six week course, and you'll spend around 10hrs per week if you do everything (and I'd recommend doing everything).

Udacity's introduction to statistics, so far, has been really easy.

From what I've seen so far, Udacity are aiming to be more inclusive "community college" level and Coursera are aiming for a true university level education.

dionysiac · on July 8, 2012

Have a look at http://betterexplained.com/ - Kalid's got some of the most intuitive explanations I've seen.

crasshopper · on July 8, 2012

I may have answered your question here: http://isomorphismes.tumblr.com/post/15431506533/i-never-did....

The benefit of DIY maths is that you get to focus on what your interest is, whether that be algebra or geometry or stats/data analysis or discrete maths or foundations or whatever. (Contrary to some answers here, I don't think trig, measure theory, or planar geometry are necessary, unless you read something and it catches your fancy.) If your goal is just to do more maths, then just throw away anything that bores you and dive into anything that interests you.

If you're interested in C.S. applications then I think discrete maths is the thing to focus on. Eg, "Concrete Mathematics" by Knuth. But you can go an even more direct route and if algorithms is your goal, there are some free courses available online that do just that.

MIT OCW and AcademicEarth are excellent for in-depth (course-length) treatments of material by seasoned teachers. Wikipedia and math.StackExchange are good for broad overviews and quick Q's, respectively.

For "proofs, and other practical applications" -- that's quite broad. OCW has a course on geophysics and a course on signal processing -- both are "practical applications" but I don't know if that's what you have in mind. Most of the OCW stuff -- whether it's chemistry, materials sci, DSP, whatever -- is aimed at engineers (other than the upper level pure maths), and therefore practical.

As for proofs, you will see them in almost any maths textbook. Spivak makes some challenging ones and if you haven't taken calculus or linear algebra then Flanigan & Kazdan might be worth a skim. But if you're interested in CS applications then you can find those kinds of argumentation in an algorithms course.

BTW, I do not recommend the Khan Academy. It gets a lot of press but I have not found it so great. (It is fine but so is just surfing Youtube -- a lot of people make videos besides SK.)

Also, if you can still edit this question, you might give us some idea of where you're starting from and how much time you expect to spend, to elicit better answers.

HTH.

PS By "the hard way" do you mean doing exercises? Because pretty much all mathematics texts expect you to do it "the hard way" if that's what you mean.

anatoly · on July 8, 2012

It's bizarre that you start your list of recommended books on mathematics for "people who never got far with it at school" with Penrose's _The Road to Reality_. No one should follow that recommendation. The book is not really readable without a background of a Ph.D.-level knowledge in math.

crasshopper · on July 8, 2012

I don't have a Ph.D. and I found it a fun read. But this is why I say everyone has to follow their taste.

kmfrk · on July 8, 2012

I think Mathematics for Computer Science Eric Lehman and Tom Leighton (http://www.cs.princeton.edu/courses/archive/spr10/cos433/mat...) is a great resource. It has the best explanation of induction I have seen - discrete maths books are normally completely awful in this regard.

Use something lke Khan Academy to fill in the blanks.

impendia · on July 9, 2012

I recommend Susanna Epp's Discrete Mathematics with Applications, which I recently taught a university course out of quite successfully. It offers a gentle but rigorous introduction to proofs, logic, and combinatorics, with applications to CS. Useful exercises too.

It is squarely targeted at beginners, if you know the stuff already then many other commenters have recommended more advanced books.

econner · on July 8, 2012

Check out the forums at http://artofproblemsolving.com/

tel · on July 8, 2012

I'm going to second Arun2009's recommendation for How to Solve It by Polya. And then I'm going to recommend Haskell. I recently realized how much my understand of set theory improved and was solidified purely by programming Haskell a lot.

vitno · on July 8, 2012

I started messing around with haskell a few months ago. It completely inspired me to revamp my entire mathematical understanding... I've bought (and read!) like 4 math text books...

jasomill · on July 8, 2012

I'll third How to Solve It, and would also recommend Polya's Induction and Analogy in Mathematics.

motxilo · on July 8, 2012

Math + Haskell = http://homepages.cwi.nl/~jve/HR/

akulbe · on July 8, 2012

So what if you really suck at algebra, and need to start over? Geometry always made much more sense to me, as I could see it, touch it, it was real!

wingerlang · on July 8, 2012

Although not algebra per se, but with Linear Algebra I did that. I only read the uber-basic Vector math and such but then I skipped the rest of the class.

Then I started applying it into programming step by step which made it possible for me to look at every little calculation.

Then I made a tiny physics simulation: http://www.youtube.com/watch?v=ud8NirjyLAA

dkc · on July 8, 2012

How to Prove It. It's cheap (paperback), highly rated, and insightful.

HelpWFractions · on July 11, 2012

Khanacademy.org is one of my personal favorite resources.

srkiranraj · on July 8, 2012

Try the "MATH FOR CS" course in MIT Opencourseware :)

bob_hancock · on July 8, 2012

http://courses.csail.mit.edu/6.042/spring12/mcs.pdf This is an excellent place to start.

keva161 · on July 8, 2012

Khan Academy

ten_fingers · on July 8, 2012

Just follow the usual path for education in relatively applied math.

Here is a nutshell description:

The standard high school level subjects are algebra, plane geometry, second year algebra, trigonometry, and solid geometry.

The standard college level subjects are calculus, abstract algebra, linear algebra, advanced calculus, ordinary differential equations. Might also take elementary courses in probability and statistics.

Standard graduate school topics are point set topology, measure theory, functional analysis.

For more in applications, can do (1) optimization -- linear, network linear, linear integer, quadratic, non-linear, and dynamic programming -- and (2) probability, statistics, and stochastic processes based on measure theory.

Here is an overview:

The course in abstract algebra will make you familiar with a long list of topics that come up in computing including sets, Boolean algebra, logic, relations, mappings, integers, prime numbers, modular arithmetic, rational numbers, the fundamental theorem of arithmetic, the fundamental theorem of algebra, elementary number theory, and the Euclidean greatest common divisor algorithm. You will also see finite field theory which has long been important in algebraic coding theory. You may touch on the question of P versus NP. You may see some of the work of Goedel, etc. and model theory. You will get good with math based on definitions, theorems, and proofs and develop 'mathematical maturity', i.e., ability to read, understand, and do abstract mathematics. Your ability to describe logical material in writing will improve.

Famous authors in abstract algebra include Birkhoff, MacLane, Herstein, Lang. More recent authors likely touch on algebraic geometry.

Linear algebra, mostly about 'vectors' and linear transformations, is the core of an ocean of applications in multivariate statistics (regression, factor analysis, analysis of variance, discriminate analysis, and more and, thus, applications in ad targeting, machine learning, data mining, anomaly detection, recommendation engines, etc.), optimization, group representation theory as in molecular spectroscopy, Shannon's information theory, and more. E.g., linear algebra is the place to learn about solving systems of linear equations, e.g., by Gauss elimination, and, thus, the place to get started with matrix inversion and the simplex algorithm of linear programming (optimization) and network linear programming.

A good first text in linear algebra is by Ben Noble. The crown jewel is by Paul Halmos, 'Finite Dimensional Vector Spaces', written when Halmos was an assistant to von Neumann; the book approaches linear algebra much like it was functional analysis and, thus, is a baby version of Hilbert space theory and, thus, sometimes used to teach Hilbert space theory to physicists for quantum mechanics. Richard Bellman wrote tons. See also Roger Horn's books. Evar Nering is a good first book (although his treatment of linear programming is not good); so is Hoffman and Kunze (apparently now available on the Internet in PDF for free). For numerical linear algebra, see Forsythe and Moler.

The standard advanced calculus text to teach the theorems and proofs of calculus is Walter Rudin's 'Principles of Mathematical Analysis'. After that book, notation such as O( n ln(n) ) as in Knuth's TACP will be child's play.

Note: At one time the Halmos 'Finite Dimensional Vector Spaces' and Rudin's 'Principles' were used in Harvard's Math 55 with a colorful description in

     http://www.american.com/archive/2008/march-april-magazine-contents/why-can2019t-a-woman-be-more-like-a-man/?searchterm=Sommers

A course in trigonometry, the use of trigonometry in calculus, and Rudin's treatment of Fourier series will help in data compression (e.g., JPG), time-invariant linear systems, digital filtering, and Shannon's information theory.

Also good coverage of the binomial theorem in high school or abstract algebra and an elementary course in probability will be a help in following Knuth's TACP.

Measure theory will take a huge load off your back: Calculus, that is, Riemann integration theory as taught through Rudin's 'Principles', has some rough edges, understood back to at least 1900, and Lebesgue and Borel and measure theory (heavily due to Lebesgue) provide a clean solution. As from Kolmogorov, the solution also makes a clean foundation for probability, stochastic processes, and statistics.

Likely the nicest first text in measure theory is Royden's 'Real Analysis'. Also good is the first, real, half of Rudin's 'Real and Complex Analysis' where he also gives nice introductions to both Hilbert and Banach spaces and a nice treatment of the Fourier transform.

My favorite authors in probability based on measure theory are J. Neveu (long in Paris) and L. Breiman (long at Berkeley). Or just read from their teacher, M. Loeve (long at Berkeley but with a writing style that looks like some cross between English and French).

This background will also let you do much more in both pure and applied math in many directions and for many applications.

For learning math, it is "not a spectator sport". Most of the work is between your ears as you think about the material. A good teacher in abstract algebra is likely necessary to get you started. Also without at least occasional contact with a solid university department it is too easy to get off track. Still, nearly all the work is to be done alone in a quiet room, and there a book is fine. In principle videos could help, but so far I've never seen even one that I would recommend for any utility at all in learning math.

btilly · on July 8, 2012

Most areas of CS you do are unlikely to need advanced calculus or topics that depend on it (differential equations, topology, measure theory...). Yes, we can both name exceptions, but on the whole you can learn them later if it proves relevant to what you want to do.

You are right that you do need algebra, linear algebra, just enough calculus to understand infinite series (which is usually put in the second calculus course), etc.

You left out graph theory and combinatorics, both of which are extremely important to CS.

Dn_Ab · on July 8, 2012

I agree. ten's list is excellent but the OP wanted to learn topics that would help with CS and algorithms. To that end graph theory, combinatorics and basic calculus should have been given more weight. But for theoretical CS I can't think of where Linear algebra would be useful. Abstract Algebra yes, but why Linear Algebra ? (p.s. i read your stuff on kelly criterion a long whiles ago, top notch, thanks!)

----

I would also suggest more emphasis on logic, sets, and type theory. Category Theory is also something fairly common in CS. So I would recommend (Free!): www.cs.unibo.it/~asperti/PAPERS/book.pdf

Abstract algebra: http://abstract.ups.edu/

And while I am skeptical as to the need of linear algebra for CS, it is such a key requirement for mathematical maturity I will suggest: http://www.math.miami.edu/~ec/book/. If Linear algebra is a building then abstract algebra is the frame. The two should really be taught at the same time.

btilly · on July 8, 2012

But for theoretical CS I can't think of where Linear algebra would be useful. Abstract Algebra yes, but why Linear Algebra?

Linear algebra is a necessary piece of background for linear programming (including the simplex method) and or standard approximation algorithms for many NP-hard problems.

Strassen's algorithm for fast matrix multiplication is commonly taught in algorithms class. It does not make much sense unless you know what matrix multiplication is.

Also if you want to work in computer graphics (which comes up both in theoretical and applied problems) you will need a solid understanding of linear algebra and matrices to understand the material.

I could list more, but that's enough to demonstrate that linear algebra does come up in a lot of places.

(p.s. i read your stuff on kelly criterion a long whiles ago, top notch, thanks!)

You're welcome, and thanks for the compliment. :-)

Dn_Ab · on July 8, 2012

I thought of games and linear programming but didn't really count them as theoretical CS. The graphics and compilers groups don't really mix but I guess Graphics researchers are doing theoretical CS. But what's more, I didnt consider that there are people working on better algorithms for various integral transforms, on learning theory bounds and algorithms for various matrix operations and decompositions. Doh. For some reason I most strongly associate CS Theory to Languages. Which was silly in hindsight.

btilly · on July 8, 2012

We tend to see the world through a filter of our own experience. :-)

ten_fingers · on July 8, 2012

> Most areas of CS you do are unlikely to need advanced calculus or topics that depend on it (differential equations, topology, measure theory...).

Right. The only advanced calculus text I listed was 'Baby Rudin', and the main contribution there is just to get some of the more important properties of the real numbers, Euclidean n-space, infinite sequences and series, and Riemann integration solid.

If a CS student is to stop before these topics, okay, but if they are going to go on then these topics will be part of what is generally assumed.

For CS, as we know, it is doing what EE did -- moving beyond its core tools and into what to do with those tools. E.g., EE got into nonlinear filtering and stochastic integration. E.g., CS is now getting into both optimization and statistics. Then being handy with that Baby Rudin material, at least the early chapters, will get to be important.

With Baby Rudin already done, there can be a good course in differential equations, and such a course can be a good way to see some of the value of what did in linear algebra and Baby Rudin and to exercise that material. At some point in the future, a CS guy might well get into some work involving differential equations -- viral growth models, flight of airplanes and space vehicles, and much of mechanical engineering.

One of the main themes in the future of computer applications is handling 'randomness', and my view is that serious work in that direction should have the measure theory foundations. I did an A/B on that! Early in my career I tried the easy way. After measure theory, Neveu, Breiman, Loeve, Dynkin, Lipster, Shiryaev, etc., I concluded that the measure theory approach to probability, stochastic processes, and statistics was essential.

In particular, without measure theory, people too easily get totally stuck in the mud on what 'random' means, while with measure theory Kolmogorov has a really nice answer. My view is that people may not like Kolmogorov's answer but that, with some really simple assumptions, we get forced into that answer anyway!

For the rest, the Hilbert and Banach spaces won't go away! E.g., a huge dessert buffet of really finger lick'n good applications of the Hahn-Banach theorem is David G. Luenberger, 'Optimization by Vector Space Methods', and that book will be a nice source of methods for a lot that computing people might encounter.

> You left out graph theory and combinatorics, both of which are extremely important to CS.

For combinatorics, I assumed that one would get enough, in some ways deeper than in Knuth's TACP, from abstract algebra and elementary probability. E.g., combinatorics is a lot about counting, and so is group theory in abstract algebra.

For graph theory, I assumed that one would get enough from optimization on networks. E.g., a 'basis' in the network simplex method on a network is a minimum spanning tree! And the max flow/min cut theorem can follow just from linear programming. Dynamic programming, which I mentioned, can be viewed as graph theory.

I did mention linear programming (and so does CLRS), and one reason is that in the CS study of algorithms the algorithms for linear programming are important, and surprising, benchmarks. Also integer linear programming is one of the more important motivations for the question of P versus NP.

For some of the surprise, the simplex algorithm has low degree polynomial expected performance (K. Borgward) but exponential worst case performance (Klee and Minty), but the polynomial algorithms (Khachiyan) when they are faster than simplex have both of them too slow for practice! So, at one of the first places we looked at computational complexity for problems more challenging than, say, heap sort and AVL trees, an exponential algorithm turned out to be superior in nearly every sense of interest to a polynomial algorithm! There never was a guarantee that the study of computational complexity would be easy!

But there's a lot of overlap: I didn't mention courses in 'finite mathematics', combinatorics, or graph theory, but one way and another what I described should provide enough coverage. I didn't mention the CS book CLRS, but I mentioned good coverage of some of the more advanced topics in that book. Or, there is a lot of blending old wine and pouring it into new bottles.

There is a broad point: A major theme in CS now is to borrow, modify, and apply work done some years ago in applied math, especially from operations research, e.g., linear programming, flows on networks, stochastic point processes, and statistics. While CS has some new applications, typically the material is done more carefully in the old applied math sources. So, I emphasized learning the material as math instead of as CS. Besides, the OP was asking about math for CS and not 'mathematical CS'! Also the title asked for the "hard way"!

There's another broad point: What to learn and why to learn it? Just a first cut view of what O( n ln(n) ) means should take only a little searching on the Internet. Besides, Knuth's TACP is quite clear on such 'asymptotics'. I wish Microsoft's MSDN documentation of .NET was as clear and easy to read as TACP!

One reason not to learn this stuff is to confirm that Knuth, Sedgewick, CLRS, etc. were correct after all!

Generally the reason to learn such stuff is for new applications in the future. For that, my view was just to stay with a relatively traditional course in relatively applied math although I leaned away from classic mathematical physics and more to business applications.

btilly · on July 8, 2012

Meh. If you want to learn hard math to make it harder, and to give you background for stuff you might encounter, why not go whole hog? Go ahead, learn algebraic topology so that you can understand category theory properly, then when you encounter it in CS you'll know what people are talking about.

I think there is a point of diminishing returns.

Baby Rudin I'm dubious about. But Royden and big Rudin (both of which you recommended) I have certainty about. There are good reasons that I never saw CS students in my real analysis classes. I don't think it is particularly valuable for CS either then or now, to acquire a deep understanding of real analysis.

And yes, I know about measure theory. I know how it applies to probability. But I went the other way. I learned measure theory. Then I learned probability. Then I began having to do probability stuff in the real world. And not once has my measure theory background been particularly relevant.

As for Hilbert and Banach spaces, they are key pieces of mathematics. In fields from wavelets to optimization theory, they come up over and over again. But I would wager that most computer science professors do not need to know what Hilbert and Banach spaces are. I'd even bet that most have not heard of the Hahn-Banach theorem. Again, if you find yourself going that way, learn it later.

On combinatorics and graph theory, you claim that people will learn enough of that material elsewhere. Maybe, maybe not. But it is clear that programming problems routinely get turned into graph theory problems, many of the most important programming algorithms are about graph theory (start with the traveling salesman problem and work your way through the list of NP-complete problems), and at its heart, analyzing an algorithm's run-time is a combinatorics problem. Acquiring the necessary concepts and vocabulary for those is necessary, whether you classify the book you're learning from as a math text or a CS text.

ten_fingers · on July 9, 2012

> I think there is a point of diminishing returns.

Yes, there is a big question about what to learn, about how much to invest in such things.

> Baby Rudin I'm dubious about. But Royden and big Rudin (both of which you recommended) I have certainty about.

But Baby Rudin is a prerequisite to Royden and big Rudin.

I'm sorry, but probability, stochastic processes, and mathematical statistics were junk for me until I went at them via measure theory.

I floundered terribly with random variables until I saw the measure theory definition; it's terrific: Go take 10,000 measurements. Now have the values of 10,000 random variables. Any 10,000 measurements at all. So far, no concept of 'randomness' at all. So, random variables are very general things and, e.g., handle even deterministic processes as a special case.

E.g., sufficient statistics is just an application of the Radon-Nikodym theorem, and a total train wreck to do otherwise. Yes, order statistics are always sufficient, maybe nice to know in 'data mining'. That sample mean and sample variance are sufficient in the Gaussian case is mind blowing; nice opportunity for 'data compression'!

E.g., measure theory and the Radon-Nikodym theorem define conditional expectation, that is, under mild assumptions, E[Y|X] = f(X) for some measurable f. Then easily f(X) is the best non-linear least squares approximation of Y. Nice.

Further, if 'cross tabulate' Y on X, then have a discrete approximation to E[Y|X] which shows that cross tabulation is a discrete version of the best non-linear approximation of Y given X.

Measure theory permits working with all the forms of convergence of random variables, especially strong convergence, at least awkward to do otherwise.

Then martingale theory makes little sense without measure theory.

Measure theory, the Kolmogorov extension, shows that we really can have a collection of random variables with desired properties.

Measure theory is crucial in even defining E[Y|U(t), t <- a] since we are conditioning on an uncountably infinite collection of random variables. Measure theory, then, shows that we can replace U(t), t <= a by the sigma algebra they generate and, then, condition on the sigma algebra. Cute.

Constructions such as

E[Y|U(t), t <= a]

are crucial in the nice qualitative, axiomatic definition of the Poisson process.

Similarly for independence of two collections of random variables where each collection has uncountably infinitely many random variables.

Measure theory was crucial in the standard results of ergodic theory.

I wrote a paper on anomaly detection in server farms and networks, and the key idea in the paper was a finite group of measure preserving transformations lifted roughly from ergodic theory.

Via measure theory we can show that the space of real valued L^2 random variables is complete, and, thus, a Hilbert space, which continues to blow my mind that any such thing could be true. I'd also like to have locally compact, but that's a bit much to hope for!

The Doob decomposition shows that every stochastic process is the sum of a martingale and a predictable process, all measure theory!

It's tough enough to believe in probability with the measure theory foundations; otherwise, I couldn't swallow the stuff!

There is a broad point: Maybe OP wants to know what to learn for the applications of the future. Then what current CS profs know is not necessarily very relevant!

btilly · on July 9, 2012

OK, what it sounds like to me is that you found a CS niche that needs a ton of measure theory.

That does not tell me that this will be the future of CS. But there are niches where this stuff is more applicable than I had realized.

ten_fingers · on July 9, 2012

The future of CS will be in several, maybe many, directions. E.g., there will be new struggles keeping the cores busy when we have 1000 cores per processor. My software development is taking too darned long; one direction is to make that faster.

But one direction is handling 'randomness', and for that I recommend a measure theory foundation of probability, stochastic processes, and mathematical statistics. Just a recommendation. It's risky; your mileage may vary. As Yoda said, "Always difficult to see, the future".

Dn_Ab · on July 8, 2012

You argument is solid and it would be great if math was taught more as a connected whole but many do not have that luxury. Key though, is the author never went into much detail as to his or her intentions and motivations so not much can be said if your list is inspiring or too intimidating.

One thing I'd like to point out is that measure theory is not the only and probably the least interesting way to study probability. There is the more elegant (IMO) approach via Nonstandard Analysis. And the fun more practical approach via Game Theory and Markets (which also support "imprecise probabilities").

I also think theres room for different approaches to the same thing, each offering their own unique insight. Many differential equation modelling problems, especially those involving populations could be fruitfully replaced by agent modelling.

pmiller2 · on July 8, 2012

Do you have any book recommendations for a nonstandard analysis treatment of probability? I'm really only familiar with the measure theory approach, myself. (In fact, I've been known to say that probability is the study of measurable functions with finite, nonzero integral over the real line.)

Dn_Ab · on July 9, 2012

The standard is Nelson's Radically elementary probability.

Vovks and Shafer's game-theoretic approach is interesting in that related approaches like bandit models and online learning have recently picked up in popularity.

btilly · on July 8, 2012

Nonstandard Analysis is an interesting side note. But pulling out the axiom of choice to differentiate x^2 is a bit much IMO. (Yes, I'm aware that there are different ways to construct the nonstandard model. But the subtleties needed to really understanding NSA are substantial. I far prefer the little-o approach that Knuth recommends.)

Dn_Ab · on July 9, 2012

Perhaps you are more familiar with Robinson's derivation but Keisler has an axiomatic approach that makes calculus easier for students than regular calculus (besides by the time you are doing continuous differentiation you are already calling on some far out concepts like real numbers and inifinity).

And real understanding takes time no matter what you do, the best one can do is start off in a manner such that the tools required to understand well enough are not more complicated than the subject matter itself.

btilly · on July 9, 2012

Yes, pedagogically you can just use infinitesmals after an appropriate hand wave. Most students will ignore the complicated details and just absorb the infinitesmal picture. If you take the attitude that students just need to learn the formulas, then it doesn't matter what approach you take.

However you've created difficulties for the day that they dive back in and try to really learn the subject. Because as they go to learn what a real number really is, and what its properties are, and about pathological functions, they also have to learn about the hyper-reals and a complex model-theory construction that (in both variations that I am aware of) requires choice.

There are a variety of other pedagogical choices that do not present such barriers to comprehension. (Note, by no means am I a fan of the limit approach. I know full well that it goes over the heads of the students, and I see no point in having the person at the front blather on about stuff that the class is not able to expected to understand.)

In any case my biggest complaint about how Calculus is taught is this. I think people come out of a first Calculus course without understanding the tangent line properly. If you don't understand the tangent line, Calculus is a mass of formulas. Despite how easy it is symbolically to jump directly from the tangent line to the derivative, I think that a solid week should be spent on the tangent line (calculating it for more complicated functions, finding applications, etc) until it is every student understands it well enough that they are ready to take the leap of looking at the slope and building a function out of it.

Dn_Ab · on July 9, 2012

I do not agree. This is because the text treats infinitesimals and epsilon-delta at the same time (or after). By that time the material is familiar enough that epsilon delta is not so difficult. Kind of like learning python before assembly. The student would also get some predicate logic much earlier than normal which is a great boon. As for the reals, how many people end up studying that in depth? All I remember is being proud of doing something related to the awesomely named Dedekind Cuts.

Anyone that can follow the proofs required to construct the reals can easily do so for the hyperreals. I am not an expert in the area and it's been years since I studied at depth but I know there are methods which avoid the need for model theory. I do not recall them invoking the axiom of choice but I could be wrong on that.

But the point is not that the learner gets a Hyperreal only approach but a varied exposure. I think that is the key - much more important than understanding tangents (what happens to visualization at dimensions of 4?) I think that learning all of the disparate parts at once and letting the student have the time to become comfortable (linear maps, derivatives, surfaces, groups) is what would be best. Allowing them to drift and backtrack and then whenever they felt ready would take whatever appropriate exams to show mastery of each area. The exams would also allow for more interesting problems.

It would take longer but the end product would be a far more cohesive understanding than the mishmashed nature of the current historical siloed approach. You've done machine learning right? I think one can take something from there about learning. The brain is just a much more advanced version of those basic objects: more and varied examples is better than curated and small examples. It won't overwhelm anyone unless they're overly impatient and then, math is probably not for that personality type.

btilly · on July 9, 2012

I have been through constructions of the reals, and the proofs for the hyperreals. It has been years, but I am confident that without much difficulty I could probably give you every proof that is necessary from scratch.

The proofs for the hyperreals involve a lot more machinery than the proofs for the standard reals. That is my educated opinion based on knowing both sets of proofs and constructions. (Of course this need not make infinitesmals a pedagogical disaster - very few students actually care much about learning the proofs.)

As for the model theory approach, I am intimately familiar with the ultrafilter construction. It uses choice. I know there is a second construction which I am not familiar with, but from what I've read it also requires choice. Both involve model theory. That's a mighty big sledgehammer for a pretty small fly.

Incidentally Dedekind cuts can be understood as follows. The set of reals can be equated with the set of points where you can cut the rationals into two. More precisely if X is a nonempty subset of the rationals with an upper bound, we get a cut of the rationals into the set of upper bounds of X, and things that are not an upper bound of X. Any two subsets can be considered equivalent if their set of upper bounds is identical. An equivalence class of subsets is a real number.

For any cut you can generate a unique set A of rationals that are not upper bounds, and a set B of rationals that are upper bounds. When you do this, all rationals in A are less than all rationals in B, and A does not contain an upper bound.

Conversely if we have a partition of the rationals into 2 non-empty sets A and B such that all members of A are less than all members of B, and A does not contain an upper bound. Then we have a cut of the rationals.

So there is a 1-1 correspondence of reals to places we can cut the rationals to partitions of rationals with that property.

Those partitions of the rationals are called Dedekind cuts.

(This is one of two constructions of the real numbers. The other, Cauchy sequences, turns out to generalize more usefully in the field of topology.)

Dn_Ab · on July 10, 2012

Hey, you are right. the method I was thinking of does require model theory to justify its axioms (IST). It had been waved away as you put it, so that the core could be focused on. But you don't really need to understand why the axioms are justified any more than most people understand the axioms of ZFC (excepting those like you of course). And if the outcome is a better first intuition of calculus, I don't think it is accurate to label it a fly.

I did find out that there is a constructive approach though, so the axiom of choice is not actually necessary. www.math.ucla.edu/~asl/bsl/0403/0403-001.ps

dvse · on July 8, 2012

There is a number of really quite good applied maths courses on youtube (roughly in this order from each group):

MIT 18.06, 18.085, 18.086, 6.262, 6.450

Stanford EE263, EE364A, EE261

Profs Strang, Boyd and Gallager are quite a bit better with maths than the typical engineering lecturer, even though their courses are not exactly at the level of Rudin, Breiman et al.

lhh · on July 8, 2012

My favorite math class was probably 18.310, intro to applied math, which is on ocw: http://ocw.mit.edu/courses/mathematics/18-310c-principles-of...

It's not comprehensive by any means, and you probably need to know at least calculus to be ready for most of this, but it covers some pretty cool stuff, including RSA.

jasomill · on July 8, 2012

While I love Rudin's Real and Complex Analysis and Functional Analysis, I've always thought Loomis and Sternberg's Advanced Calculus could serve as an interesting alternative to more conventional texts like Baby Rudin for introductory analysis. It also seems particularly suited to self-study: surprisingly self-contained, good exercises, a nice selection of applications, and available free from Sternberg's Web site [1].

[1] http://www.math.harvard.edu/~shlomo/

ten_fingers · on July 8, 2012

'Advanced calculus' is a huge, ill-defined subject, partly a catch-all of introductions to several large topics. So I tried not to give a 'clarifying guide to advanced calculus' and, really, instead to concentrate on what would be of more interest to the OP. In particular, I listed only Rudin's 'Principles'.

Generally in advanced calculus I avoided the discussion of, and much connection with, geometry, Stokes theorem, and exterior algebra. So, I avoided Buck, Fleming, Spivak, and of course also, now in English, Henri Cartan, 'Differential Forms'. I even avoided the classic applied advanced calculus text, long used at MIT, Francis B. Hildebrand, 'Advanced Calculus for Applications'.

For Loomis and Sternberg, I agree with you, and have both the hard copy and the PDF.

Since I've mentioned such advanced calculus, I will try to save many students: Students, there's a secret. The secret is that vector analysis, Stokes theorem, etc. are important in physics and engineering; they will also be important in computing when computing concentrates on such physics and engineering. But still mostly what you will find in physics and engineering is vector analysis much as it was done in the 19th century which the late 20th century math departments liked about as much as a skunk at a garden party.

If you read the modern treatments, complete with differential forms, then you will be at the head of the class in an advanced class in general relativity (e.g., Misner, Thorne, and Wheeler) but will still be lost in much of old physics and engineering!

So, what to do? Sure, go to Tom M. Apostol, 'Mathematical Analysis: A Modern Approach to Advanced Calculus', Addison-Wesley, Reading, Massachusetts, 1957. The good thing about this book is the lie in the title -- it's mostly a 19th century treatment and not "modern"! So do whatever you have to do to get a copy. And get the 1957 edition and NOT a more recent edition where he omitted the 'good stuff'!

Then, in about 20 pages of the sweetest dessert you ever tasted, with line integrals, conservative force fields, and potentials, volume and surface integrals, nice stuff like that, you will find a charmingly clear presentation of what you need. Right: The treatment is not up to the precision of Rudin and actually needs pictures. Still it's what you need for much of physics and engineering. It's, uh, 'intuitive' math; trying to make that material as precise as Rudin could take you, well, a long time.

And it's EASY -- can take it with a couple of beers and have a really fun evening. Then don't tell anyone where you learned it! Besides, at its core, it's just nice uses of the fundamental theorem of calculus you saw in freshman calculus! Did I mention, it's easy?

The key point about Rudin's 'Principles' is the care with which he covers the real numbers, compactness, continuity and uniform continuity, sequences and series, and the Riemann integral (yes, patched up with the Stieltjes extension which isn't much different). So, he concentrates hard on the foundations. For someone like the OP, getting those foundations solid is likely more important than rushing into many of the more famous topics in 'advanced calculus' -- Fourier series, the heat equation, Lagrange multipliers, vibrating strings (boundary value problems), the Navier-Stokes equations, series solutions to ordinary differential equations, etc.

While I like Rudin, 'cut many of my math teeth' on Rudin, and really like some of his treatments of some topics, I omitted some notes on how to read Rudin; some such notes could be helpful. In particular, Rudin has some places where it's easy to get stuck, and students should be advised not to get stuck (don't assume that just because you can't see how to solve some one exercise must be missing something important) and if necessary just to look for other sources, ask for help, skip over and come back, or just f'get about it. Rudin was one of the best writers of his material, but he was not perfect, varied, got easier to read as he wrote more, but still is relatively severe. Due to the severity, there have been some people, e.g., at Courant, who didn't like Rudin!

As much as I like the real half of his 'R&CA', he gets a bit severe and obscure in a few places (his novel and surprising but long and 'unstructured' construction of Lebesgue measure and his work on regular Borel measures); net, for most students it would be good to read Royden first or in parallel.

Rudin has two exercises that can slow people down: (1) Every closed set is the union of a perfect set and a set that is at most countable and (2) there are no countably infinite sigma algebras. Both exercises require paying attention to what is countable versus uncountable. The first one I worked on about 14 hours a day for two weeks before someone mentioned 'uncountable' at which time I got it in about 90 seconds. The second one took me a long evening, but I was the only one in the class who got it. For the first one, eventually Rudin included the hint. Students: Don't get stuck on such exercises.

For Rudin's 'Functional Analysis', I nearly went to Brown's Division of Applied Math but at the last moment went to Hopkins instead. Brown was using Rudin's FA, so I got a copy and at Hopkins asked for a reading course in it. Alas, the prof had never heard of that book and declined to participate! So, Rudin's FA, along with his 'Fourier Analysis on Groups' or some such are still sitting new on my shelf as I write software!

nsomaru · on July 9, 2012

I am intending on doing distance learning, with non-physical contact with lecturers / fellow students. Do you think it's possible yo achieve mastery this way? Are there online communities etc. which could help?

ten_fingers · on July 9, 2012

The first-cut, rough answer is, yes, you can. Another answer is, no matter what, most of what you have to do has to be done alone in a quiet room with the book, or books, lecture notes, or whatever.

But there are some risks.

First, broadly a big risk is getting off track. There are many poor books, and dig into one of those and might not ever come out. And there are side streets; might go down one of those and never get back to the main road. Don't do those things.

So, the lessons: Pick subjects carefully. In any subject, use only very highly recommended books, and use more than one in parallel although likely one book as your 'primary' source and the others as 'supplementary'.

For book recommendations, get those from a better source than Hacker News! I suggest getting recommendations from some of the best courses and professors in the departments of at some of the world's best, and I mean at top 20 or top 10, research universities. So, for a subject you want to learn, find what books such courses, profs, departments, and universities are using and recommending.

Second, another big risk is getting stuck. One way is to encounter an exercise you can't solve, believing that you are missing something important, and then just grinding to a stop, for days, weeks, or forever, on that exercise. Don't do that.

(A) There are a lot of 'misplaced' exercises. Just because an exercise is in a book doesn't mean that it is reasonable to be able to solve it yet. Yes, in a good book you should be able to solve 90% of the exercises, but one book may have 1-20 exercises that are just nasty chuckholes in the road. I know one beautiful book where the exercises are so difficult, and apparently usually beyond the book, that one should get a prize for each doable exercise they find! (B) There are some tricky exercises that are just tricky and not really important. (C) There's no way you are going to miss much if you skip a few exercises. (D) You can continue on, maybe ask for help, look at your supplementary sources, etc. and then either get a solution or conclude that you haven't missed much. Yes, do work hard, but definitely don't get stuck. After enough hours at one spot, just move on.

Going on will sometimes give you the tools you need to resolve the place you were having trouble. Generally it's no sin, and sometimes helpful, to rush ahead quickly, get an overview, and then return to a more complete pass, and the overview can help you judge if the place you were having trouble was important or not. Also there's no way just one pass over some material can be as effective as three or so. E.g., one goal is a 'synthesis' of the material, and that needs several passes and several sources.

Third, there are occasional actual errors in the sources. So, if something doesn't make much sense, then it's wise to suspect an actual error someplace. You might be able to resolve the issue by consulting supplementary sources, doing some derivations on your own, say, to find a counterexample, or asking for help. By now the Internet may actually be a good place to ask for help. And, there's no law against writing the author of a book!

Fourth, some books are just much better than others. And even a good book might make a mess out of a few topics. So, again, consider several sources.

Fifth, there is a broad point about 'standards of quality'. Math is by a wide margin the most precise of all academic subjects. Still, in the total body of math that is done currently or already on the shelves of the libraries, the standards of quality vary widely.

Some of the most highly polished math ever put on paper is by (the collection of largely anonymous mathematicians) Bourbaki; very little of the rest of math is so highly polished; some of the best math and math writing is not as highly polished as Bourbaki.

So, there is a danger: It's possible to work to standards that are, in some or nearly all respects, too high. Beware that in the best work, the best of that work is not some high polish but in the real 'content' that would still be good just copied off a blackboard.

Why high polish? Because it can make one more immune to criticism. But, even Bourbaki gets criticized. Some criticism will always be there. So, don't try to defend against all possible criticism.

So, the 'standards of quality' that might be worth emphasizing would be the real 'content', the 'meaning', the 'significance', the 'power', the 'meat' and not secondary issues such as polish.

Contact with a good university department will give you some insight into what standards of quality you want to pursue, not too high, not too low, worthwhile and not just wasted effort, etc.

As I've written in this thread, it may be important for you to have a good teacher for abstract algebra since that is where your approach to math will take a big change to how the rest of math works -- definitions, theorems, and proofs. You will need to learn how to get the understanding, intuition, 'conceptual models', helpful pictures, promising applications, the meaning, the 'meat', etc. out of presentations that look just abstract or like just abstract nonsense; of course, attempt to do such things only for what are without doubt high quality presentations.

You will need to learn how to write math with definitions, theorems, and proofs, and for that you may need a good prof to read and mark your homework.

If you tackle abstract algebra without a teacher, then beware of these 'side' lessons you need somehow to learn and make some effort, likely not very large, to teach them to yourself. In particular, look at some beautifully written math, e.g., John von Neumann, e.g., his 'Quantum Mechanics', Paul Halmos, Leo Breiman, and pay close attention how they write.

All the warnings aside, if you are careful about avoiding the dangers, then you should be able to do well.

Let me suggest, generally some occasional contact with a good university department should help. E.g., if you work through Halmos 'Finite Dimensional Vector Spaces', Rudin's 'Principles of Mathematical Analysis', MacLane and Birkhoff's 'Algebra', or Leo Breiman's 'Probability', along with supplementary sources for each, then maybe ask to sit in, totally informally, just to observe, one or a few sessions of a corresponding university course. And get some homework assignments and some test copies. Then see if it appears you understood the material. You might be pleasantly surprised! Also, get old copies of the qualifying exams -- if these are easy for you, then you are doing well!

At least at one time, the Princeton math department, a good candidate for the best in the world, just stated that the graduate courses were introductions to research by experts, that no courses were given for preparation for the qualifying exams, and that students were expected to do such preparation on their own. Well, Princeton was stating that they expected students to do much like what you are trying to do, independent study.

There is a big theme: It's a good guess that what's valuable in math is creating new stuff. For a Ph.D. or tenure at a research university, research is the main work. For valuable applications to problems outside math, it may be that work that is at least a little new will be much more valuable.

Then, nearly always original work in math is done largely or totally independently. So, net, at some point, being good at independent work is crucial.

crasshopper · on July 8, 2012

Based on the tone of the question, this sounds way too ambitious.

sneak · on July 8, 2012

Oh wow, thank you!

zxcvvcxz · on July 9, 2012

Question: is anyone working through Numerical Linear Algebra, or has done so in the past?

http://www.amazon.com/Numerical-Linear-Algebra-Lloyd-Trefeth...

I'm having a really tough time with the exercises even though I should have the prereqs... What can I say, we don't do much rigorous math in engineering.

Anyways this book came highly recommended to me from a friend in quantitative finance.