My colleagues have reported beam arrived and was detected by the CMS detector at 20:05 Geneva time! (CMS is one of the two big detectors as the Large Hadron Collider). Exciting times!
My colleagues have reported beam arrived and was detected by the CMS detector at 20:05 Geneva time! (CMS is one of the two big detectors as the Large Hadron Collider). Exciting times!
Yesterday night I attended a reception for the international scholars to UCSB. This was the first time that such a celebration took place in Santa Barbara, thanks to an anonymous donor. It was a lively event, and various important people from the UCSB campus showed up. I won’t bore you with the details.
Amongst the interesting facts that I collected yesterday, was that most international students in the US come to study in the science and technology fields, while very few people from the US go out to study and when they do, the statistic is mostly on humanities. I also found out that there are about 640000 students from abroad in US Universities, and that US Universities graduate about 30000 PhD’s annually.
Not surprisingly, current cuts in US University funding (especially state Universities) will hurt the efforts to get the best (graduate) students to the US, with the consequent deterioration of the pool of people with talent contributing to the US economy. An interesting article regarding this issue can be found here.
If you couple all of this progressive lack of investment by the states into the education system, the future starts looking pretty bad, not just for education, but for the US economy.
This year I am working on a University committee in charge of international education. So I have to learn quite a bit about this stuff.
Simple observation: if you lose loose the people from abroad (who are extremely good students), and you lose loose the local people (because they can not afford to go to school any longer), where is the human capital investment in the future development of technology going to come from?
Remember, modern economies depend on having the best and most innovative modern technology in order to compete. And, new ideas for technology come from people who know what the current technologies are, what they can do and how to make them. Ideas are not born from thin air.
Posted in Academia, Santa Barbara, science and society
I have just attended the XV European Workshop on String Theory. It took place in Zürich, Switzerland. Here is the link to the conference information and the talk slides. The conference featured a lot of young speakers who I had not met before. I found it very entertaining. Curiously enough, I really got to feel for the first time that different countries have their own way of doing physics. I think this reflects a bit on the attitudes of different cultures on what constitutes good taste. There is nothing wrong per se with any such preferences, it’s just an observation that I have heard many times before, but it had not really made itself so obvious to me as this time around. The menu of topics covered was rather formal and the hospitality of the event was incredible. I’m very grateful to Matthias and Matthias for providing such a welcoming environment. Here is a picture of Zürich (the best of the ones I took while wandering aimlessly through the streets of the city).
The weather was absolutely perfect and the town is truly a lovely place to loose oneself into.
Now, I’ll tell you a few anecdotes that happen occasionally to us, poor saps, who travel across the globe and find ourselves fighting this mysterious ailment called jet-lag. In particular, this is a story about myself.
I arrived at my hotel at about seven in the evening, not having slept well from the transatlantic flight from Los Angeles to Europe. I register and manage to get one or two hours of sleep. I woke up to the sound of the bells of the near clock tower at about 10 pm. Here is a picture, with a lovely moon casting its light on it.
It was a lovely bell tower with a beautiful clock and a rather loud set of bells. It rang every fifteen minutes. At the hour and fifteen it gave a short cry. At the hour and thirty, the cry of the bells was twice as long, and at forty five minutes it was three times as long. When the full hour came, it went extra long, and it finished with a lower rounder bell sound counting the hours. The clock rang all… night… long… My room was facing it directly.
Needless to say, first day of the conference, I was not better than a walking zombie. I’m sure you could hear me saying brains… brains… branes…
The second night, it was the same again. I couldn’t hold it past four in the afternoon, and after taking a nap, I woke at 10 pm again and couldn’t sleep a wink at night. Fortunately, I used the time wisely to prepare my talk. By the third night, I forced myself to stay awake and after a nice dinner organized by the conference and a little bit of wine, I finally slept a whole night and was alert the next day. The bells never bothered me again.
Of course, we all laughed together at the episode and I received various pieces of advice on how to improve the success of a 9 hour change of schedule for my body.
Another interesting thing to notice is the price of items. Europe has become expensive, but Switzerland is much more so than other places in Europe. Fortunately, some items are much cheaper than in the US. Ordering wine at restaurants is much more common, and the markup is not as high as in the US. So if one is trying to have a good time, the food costs more, and the wine costs less, so one can balance the two against each other. There are also these coins of 5 CHF. Big coins at that. You get a lot of them in change. And one shouldn’t leave them behind. If one is not paying attention, it is a lot of money in coins. They have a nice solid weight to them, so your pockets notice the extra effort they have to make. This fact alone should give you an idea of the value of items. In the US, people don’t even like using one dollar coins. If they get them in change they collect them. I never understood why they don’t circulate much.
Finally, the Swiss train system is very nice, and dense. This made me think of all of Einstein’s special relativity trains running at great speeds in various directions and I can definitely see where his obsession with clocks and synchronization came from. I experienced them first hand: both the clocks, and the synchronization of events.
Posted in Conferences, string theory, travel
There was a discussion about displaying math on the web, over at Terry Tao’s blog. It was a little disheartening to learn where most peoples’ heads are at, in that regard. Many seems to be excited at the mere prospect of creating a web service that would turn TeX equations into pictures. Such services are not exactly thin on the ground, I retorted, but hardly satisfactory, either. Fortunately, an actual blind user showed up to explain at least one reason why that wasn’t the answer.
Still, being told, “You should use MathML.” is like being told, “Eat your vegetables.” and is just about as likely to be heeded.
One objection is the lack of browser support, which – these days – means that neither Opera, nor Webkit browsers (Safari, Chrome, …) support MathML. The latter, however, seems to be changing. There’s a plan and, more importantly, there’s actually been code checked into trunk.
Of course, it’s far too early to expect anything usable, but if Alex wants to prioritize, here’s a table of MathML elements and attributes used by itex2MML. Things that aren’t yet supported by his patches are marked in bold.
| Element | Attributes |
|---|---|
| math | display |
| mi | mathvariant |
| mn | |
| mo | maxsize minsize lspace rspace stretchy |
| mrow | xmlns:xlink xlink:type xlink:href |
| mover | |
| munder | |
| munderover | |
| msub | |
| msup | |
| msubsup | |
| maction | actiontype( = toggle, statusline, tooltip) |
| mphantom | |
| mpadded | width lspace |
| mspace | height depth width |
| merror | |
| mtext | |
| mstyle | scriptlevel mathcolor mathbackground displaystyle mathvariant |
| mmultiscripts | |
| mprescripts | |
| none | |
| mfrac | linethickness |
| msqrt | |
| mroot | |
| mtable | rowalign columnalign align rowspacing columnspacing equalrows equalcolumns rowlines columnlines frame |
| mtr | rowalign columnalign |
| mtd | rowalign columnalign rowspan columnspan |
| semantics | |
| annotation-xml | encoding |
Other bits of the conversation were a little more heartening. MathML-in-text/html support is in Firefox nightlies, though disabled by default. I don’t imagine it will be turned on before Firefox 3.7 at the earliest. Judging by the number of Firefox 3.0 users, it’ll be some years yet, before we can count on using MathML-in-text/html. And I did learn about the very-cool-sounding MathJax project.
Anyway, here’s a modest contribution to what the discussion could have been about: things you can do with math on the web, that you couldn’t do in print. Hover your mouse over the right-hand side of
Unfortunately, due to a year-old Mozilla bug, the perhaps more-useful
doesn’t currently work.
These are created with itex’s \statusline{}{}, \toggle{}{} and \href{}{} commands.
\tooltip{}{} command:
Specifically, the itex code is
A=\toggle{ \tooltip{Click to hide.}{ \tfrac{2}{\sqrt{\pi}}\int_0^x e^{-t^2} d t} }{ \tooltip{The Error Function: click for the definition.}{ \mathop{erf}(x)} }
Today at tea, some grad students were discussing the following enumeration problem:
How many elements of
have zeroes in all diagonal entries?
I think they [Redacted]. The answer is apparently known but classified. It’s a sort of q-analog of derangements (i.e., permutations without fixed points), but if you take the derangement formula and add q-numbers in the naive way, the formula doesn’t seem to work for n > 2.

This is the first of a few posts I plan (one other of which is written and another of which is in draft form but in need of a few changes) in which I discuss various Polymath proposals in more detail than I did in my earlier post on possible projects.
One of my suggestions, albeit a rather tentative one, was to try to come up with a model that would show convincingly how life could emerge from non-life by purely naturalistic processes. But before this could become a sensible project it would be essential to have a more clearly defined mathematical question. By that I don’t mean a conjecture that Polymath would be trying to prove rigorously, but rather a list of properties that a model would have to have for it to count as successful. Such a list need not be fully precise, but in my view it should be reasonably precise, so that the task is reasonably well defined. It would of course be possible to change the desiderata as one went along.
In this post I’d like to make a preliminary list. It will undoubtedly be unsatisfactory in many ways, but I hope that there will be a subsequent discussion and that from it a better list will emerge. The purpose of this is not to start a Polymath project, but simply to attempt to define a Polymath proposal that might at some future date be an actual project. For two reasons I wouldn’t want this to be a serious project just yet: it seems a good idea to think quite hard about how it would actually work in practice, and someone who I hope will be a key participant is very busy for the next few months and less busy thereafter.
As a starting point, let me mention two ideas that are already out there and have attracted a lot of attention. One is the idea of cellular automata. A fairly general type of cellular automaton can be defined as follows. You have a graph (usually something like an infinite two-dimensional lattice), and at some points you have 1s and at other points you have 0s. You then let the system evolve in rounds according to some simple rule that is usually the same for every vertex. It might be something like this: if at least two of my neighbours are 1s then I will become a 1, and otherwise I will become a 0. It turns out that very simple rules can lead to extremely complicated and interesting behaviour.
What counts as complicated and interesting? Well, perhaps it is better to say what counts as dull. One possible form of dullness is if a system evolves to some state such as the all-1s state, or perhaps a big rectangle full of 1s with 0s outside, or an oscillation between two configurations. Another form of dullness is a system that tends to disperse the 1s until they form some fairly random looking bunch of 1s that never stops looking fairly random. But in between, there are systems that tend to evolve towards some kind of criticality, where you get fractal structures with organization at many different distance scales. One thing that interests people about cellular automata is that there are very simple rules that seem to want to evolve towards these nice “edge of chaos” patterns.
The second idea is self-organized criticality, which is a phenomenon exhibited by certain models in statistical physics — notably the so-called sandpile models. These are supposed to model what happens if you drop grains of sand one by one on to a pile. They will start to build up into a conical shape, but if the sides get too steep there are avalanches. The sizes of these avalanches vary, and if you plot the frequency of avalanches of various sizes, you find (experimentally at least) that they obey a power law. And power laws get people excited because they are what you find associated with critical phenomena. A typical sandpile model is something like this. You have a big square divided into a grid of small squares. You then set all squares equal to 0 except a few randomly chosen ones that you give small integers to. You then add 1 to the central square (let’s assume there is one), and after you have done so you have a rule that says that if any square has value at least 4 it must give 1 to each of its neighbours. This procedure you iterate until no square has value at least 4. (It can be shown that the order in which you do these operations doesn’t matter.) You then add 1 to the central square again, and keep going.
It turns out that the sizes of the “avalanches” that take place here (that is, how many iterations you have to do of the simple rule before all squares have value 0 to 3) also obey a power law, and also that systems such as these have a tendency to evolve towards interesting (that is, not too random and not too structured) configurations. That is, you can get critical phenomena without having to fine-tune some parameter. Again, this has got people excited as it seems to promise an explanation of how the complexity in nature could have started.
In the above description, I made the starting configuration random but after that the way the model evolved was deterministic. There are of course many different possible models, and in some of them the new “grains of sand” are dropped in random places. Again you get interesting critical behaviour.
Now as far as I know, with both cellular automata and sandpile models you get nice critical phenomena appearing, but while they give you pretty patterns they do not give you anything resembling an ecosystem. Yes, Conway’s game of life gives you glider guns and configurations that can reproduce themselves, but you have to set them up carefully in advance, and they don’t seem to do anything all that exciting. They also support universal computation, but again if you want to program the game of life to create an artificial-life simulator, you might as well use a much more powerful computer to do so. What a Polymath project would be looking for is a very simple system with the property that, regardless of the starting configuration, it would tend to develop and eventually produce something that looked like a complex ecosystem.
This brings me to a point that is worth making. The idea of this Polymath project would not be to produce yet another artificial life program, fascinating though those programs can be. One could think of it more like this: can one come up with a very simple model that almost always “self-organizes” and produces something that looks a bit like what you get with artificial life programs? In other words, we would be trying to model abiogenesis rather than evolution.
After that discussion, I think I can have a stab at saying what the properties are that would make a truly interesting and new model. (I am much less sure about the “new” part, and would be interested to hear from people with more knowledge about this kind of topic what the state of the art is.) Some of the properties below seem to be more important than others, but for now I won’t bother to distinguish between those that I regard as essential and those that are merely desirable.
1. It should be a dynamic model that evolves according to simple rules.
2. It should have a tendency to evolve towards patterns with a “critical” character — not too random and not too simple, with interesting features at many distance scales.
3. Probably it should be a somewhat randomized model (to give it a certain robustness). Here I am referring to the rules by which the model develops rather than the initial conditions, but perhaps the initial conditions should be randomized as well.
4. It should have a tendency to produce identifiable macroscopic structures.
5. It should be possible to classify these macroscopic structures in interesting ways. (That is, we would like to be able to say that certain structures look more or less the same as certain others, and ideally this similarity would be a bit more flexible than one just being a translation of another.)
6. These structures should interact with one another, and the interaction should sometimes be destructive (thereby providing some selection pressure).
7. With high probability, self-reproducing structures should eventually emerge. (Before posting this I showed it to Michael Nielsen, who made some interesting points. One of them is that experience in the actual universe suggests that perhaps there should be some fine tuning of parameters before the probability becomes high: after all, life does not evolve on all planets.)
I could go on, but the idea is that once you’ve got 6 and 7, and perhaps a few other properties (for instance, one might decide to have major environmental changes from time to time just to stimulate the development of the system), then natural selection can begin to operate.
Of course, the major challenge is 7. The most plausible route I can see to 7 is a purely probabilistic one: almost all configurations are not self-reproducing, but if a self-reproducing one ever does arise, then it will reproduce itself and start appearing all over the place. But in that case 5 is also a huge challenge. The kinds of structures one would ideally like are not things like the bullets from Conway’s glider guns, but larger configurations that can move about and that are defined more topologically. Indeed, that could be a huge and general problem: the geometry of just isn’t the same as the geometry of
, but a continuous model would be very difficult to design and simulate (or would it?). But perhaps there could be some cleverly chosen simple rule that would tend to protect “clumps” of 1s and allow them to move, and to do complicated things like rotating (whatever that can be made to mean in
). Or perhaps a complicated ecosystem could develop that was more
-like than
-like.
Here, incidentally, is a paragraph from the Wikipedia article on Conway’s game of life, which shows that it is not already an example of what I am talking about:
From a random initial pattern of living cells on the grid, observers will find the population constantly changing as the generations tick by. The patterns that emerge from the simple rules may be considered a form of beauty. Small isolated subpatterns with no initial symmetry tend to become symmetrical. Once this happens the symmetry may increase in richness, but it cannot be lost unless a nearby subpattern comes close enough to disturb it. In a very few cases the society eventually dies out, with all living cells vanishing, though this may not happen for a great many generations. Most initial patterns eventually “burn out”, producing either stable figures or patterns that oscillate forever between two or more states; many also produce one or more gliders or spaceships that travel indefinitely away from the initial location.
We would be looking for something a bit like the Game of Life, possibly randomized, with the important difference that it almost always got more and more complicated and more and more interesting.
Physics
There is one other property that I think would make a model more convincing as an argument for the probability of life arising out of non-life without any magic processes operating. I partly owe this thought to Michael Nielsen, who included the following two questions in a comment he made on the post where I originally mentioned this problem.
(1) How would you go about recognizing self-replicating beings?
(2) What sort of models are “reasonable”, in the sense of both reflecting what we know of physics, and being simple enough to be tractable? The Game of Life isn’t very physical, in that it disobeys many basic physical principles, like conservation of energy, conservation of mass, conservation of momentum, and so on.
One of the things that people often say about life, evolution, biological systems and the like is that they are ways of locally combatting the second law of thermodynamics. So perhaps one could add the following property as one that it would be very nice to have.
8. The general tendency for the model is to become more and more disordered, and eventually to end in heat death, but for there to be many local increases in order.
Of course, one would need to be clear what that meant. The other physical principles that Michael mentioned would also be good to have.
Here is a subproblem that occurs to me as I am writing this. It is connected with the thought that one would like macroscopic structures to have some tendency to survive. In the Game of Life, it seems that structures that survive do so almost by accident — they settle down into some sort of periodicity, say. But structures in the biological world are held together by physical forces, and they have identifiable boundaries and things like that. So one might try to develop a model that captures just this behaviour. As with the main problem, I’m not sure how to formulate this subproblem precisely, but let me have a go. Does there exist a model with the following properties?
(i) If you draw some large-scale shape (think of the 0s and 1s as black and white pixels, say, so the shape is on a much larger distance scale than the distance between two neighbouring points of the grid), it has a tendency to move “continuously”.
(ii) There is a tendency for mass and momentum to be conserved.
To give an idea of the kind of thing I mean here, let’s suppose that “mass” is represented by 1s, and you take a large annulus, place it over , and put a 1 at every grid point that lies in the annulus. Then in the interior circle of the annulus put a random scattering of not too many 1s. And then slightly move the annulus part, and slightly move all the little particles inside. If the first position of the annulus represents where some very simple structure is at time 1 and the second where it is at time 2, then conservation of mass and momentum would tell us to expect it to continue moving in the same direction (so it would be more sophisticated than a cellular automaton of the kind described earlier because its behaviour would depend not just on how it behaved an instant earlier), and to stay the same size. We might also have “forces” between neighbouring 1s that encouraged them to stay together somewhat, and so on.
Of course, as with Conway’s Game of Life, the idea would be to devise the simplest possible set of rules that did what one wanted (in this case preserve macroscopic shapes at least to some extent and allow them to move about reasonably flexibly but without distorting themselves too much). It would not be to try to create the most realistic model one could of the actual world.
Since writing the above paragraphs I’ve found out the following relevant facts. First this from the Wikipedia article on Life-like cellular automata:
Larger than Life is a family of cellular automata studied by Kellie Michele Evans. They have very large radius neighbourhoods, but perform `birth/death’ thresholding similar to Conway’s life. The LtL CA manifest eerily organic `glider’ and `blinker’ structures.
RealLife is the “continuum limit″ of Evan’s Larger Than Life CA, in the limit as the neighbourhood radius goes to infinity, while the lattice spacing goes to zero. Technically, they are not cellular automata at all, because the underlying “space” is the continuous Euclidean plane R2, not the discrete lattice Z2. They have been studied by Marcus Pivato.
Secondly, here is the paper by Marcus Pivato mentioned above.
Chemistry and the problem of scale.
By far the most famous contribution to our understanding of how life started is the Miller-Urey experiment, in which Miller and Urey attempted to simulate the chemical conditions that might have prevailed early on in the life of the Earth. They used electrodes to create lightning-like sparks that passed through a vapour that was formed of water, methane, ammonia and hydrogen, and found that they produced complex amino acids, which are essential building blocks of life.
What relevance would this experiment have for a computer simulation? My view is that one should not necessarily try to produce a virtual Miller-Urey experiment (complete with virtual lightning, virtual ammonia, etc.) but that the experiment does raise a couple of questions that it is essential to address.
A fundamental fact about life as it exists in the physical world is that it is carbon based. The great virtue of carbon is that its particular bonding properties allow it to combine with other atoms to form molecules that are large and complicated enough to encode highly sophisticated information. So an obvious question is this.
Question 1: Should one design some kind of rudimentary virtual chemistry that would make complicated “molecules” possible in principle?
The alternative is to have some very simple physical rule and hope that the chemistry emerges from it (which would be more like the Game of Life approach).
This is just one example of a general tension. The more features you design into a model, the less “universal” it becomes and the less convincing it is as a demonstration of the inevitability of life. However, one can also argue for at least some designed features. After all, if we want to explain the origin of life, it is not necessary to start with a virtual Big Bang and get from there to the possibility of complex molecules. It may be that designing rules to make complex molecules possible (and then arguing that with probability 1 this possibility is actually realized) is attacking the problem at the correct level.
I do not have a strong view about what the right answer to this question is. Obviously I would prefer the chemistry to emerge as if by magic, but that may be an unrealistic hope.
The second question does not arise directly out of the Miller-Urey experiment, but it is related.
Question 2: How large and how complicated should we expect “organisms” to be?
A real-world organism, even a micro-organism, is made out of more atoms than one could hope to simulate on a computer. (I am not certain that that last sentence is correct, but I would be very surprised if it wasn’t. Added later: Michael Nielsen tells me that there are rudimentary organisms that are so small that they could perhaps be simulated in full.) Moreover, although it has many levels of complexity, there will also be distance scales at which it is relatively simple. For example, if I look at my hand from a distance of about a yard, my skin looks smooth. Similarly, if I were to look through a powerful microscope at one of the cells of my hand, then the boundary of that cell would be reasonably smooth, rather than fractal-like. In general, it seems that if you look at a typical organism, it is not equally complicated at all distance scales, but is more like this: you take some small objects and put them together in a reasonably simple way to form bigger objects; you then use these bigger objects as building blocks for yet bigger objects; continuing this process for eight or nine (??) levels (perhaps if I knew more biology I would revise this number up considerably) you end up with a complex organism.
If that picture is roughly correct, then the number of “atoms” in a complex multicellular organism might be prohibitively large for a simulation. Is this a problem?
I think it shouldn’t be too problematic. Just as we are not trying to start with the Big Bang, neither are we trying to end with mammals. The main aim is to get to the point where evolution can take over. In particular, if a readily identifiable micro-organism appeared that could reproduce itself with small modifications, then the simulation would surely be declared a success.
Nevertheless, the question of scale remains. Would we want such a micro-organism to consist of a small handful of “pixels” that by some magic local rule gives rise to a copy of itself? Or would we want something much larger that had “smooth boundaries” at some distance scales and was composed of “complex molecules”? My inclination at the moment is to prefer the second for two reasons: it is less like the Game of Life (and therefore more likely to be novel and interesting) and it is closer to the life forms that we actually observe.
Added later: I haven’t quite made clear that one aim of such a project would be to come up with theoretical arguments. That is, it would be very nice if one could do more than have a discussion, based on intelligent guesswork, about how to design a simulation, followed (if we were lucky and found collaborators who were good at programming) by attempts to implement the designs, followed by refinements of the designs, etc. Even that could be pretty good, but some kind of theoretical (but probably not rigorous) argument that gave one good reason to expect certain models to work well would be better still. Getting the right balance between theory and experiment could be challenging. The reason I am in favour of theory is that I feel that that is where mathematicians have more chance of making a genuinely new contribution to knowledge.

We subscribe to Locus, the SF review and news magazine, and every month when it arrives, I flip through it quickly to look at the ads. This is a useful guide to what's coming out from various publishers, but it's also kind of fascinating to see how the different publishers market their stuff.
In particular, it's interesting to see how Baen pitches their books, because they are aimed with laser-like precision at people who aren't me. I'm sure their ads work very well for their target audience, but they make their forthcoming books sound absolutely horrifying to me. This month's ad may be the ultimate, featuring the following plug for a Tom Kratman book:
Undercover Infidels!
Europe 2123. Dhimmitude-- assigning second-class citizenship to non-Muslims-- has dug its claws into the continent. Now a West Point grad must rescue a young girl sold into sexual slavery after her family could not pay the Christian Tax. U. S. Army vertean Tom Kratman notches up another controversial thriller!
If that didn't scream "Run away!!!" loudly enough, there's a glowing quote from Mark Steyn. I'm not sure it would be possible to construct a more appalling book ad.
Oh, wait, I stand corrected: later in the same issue, there's a plug for The Science Behind the Secret, featuring a quote from noted con man tv psychic John Edward claiming that quantum mechanics is the basis for the "Law of Attraction." This is also a Baen book, coming in March. And it pretty much guarantees that even if they publish something I might like (hey, it could happen), I'll be getting it from the library, rather than giving them any of my beer money.
In particle physics this is what we do. We have petabytes (1000 terrabytes!) datasets consisting of billions of physics interactions. For the particularly rare ones we need to pick out several 100 or 1000 and study them in detail. As you might expect, we are drowning in data and have developed many tools to help us. Computers are central – without them we would not be able to do the science we currently do!
The most common public example of data mining I’ve heard about is looking at all the receipts from Wallmart purchases. This is why grocery stores like you to sign up for their frequent-use cards – they can track everything you buy, sell that data, and, more importantly, send you ads that are likely to get you in and get you to buy other things. It is an amazingly powerful tool. In business it has been getting a bit of a bad name recently because it has been connected to some fairly serious invasion of privacy issues (i.e. creepy things – like knowing what hour of the day you check your email, how much the average person in your zip code makes, etc.).
But one place that it could obviously be applied for the greater good that I’d never really given much thought to is medicine. Check out this long article from the NYT on the topic – Making Health Care Better. It starts with some history – and the bromide “The amount of death and disease would be less if all disease were left to itself.” from 1835… to the present day:
“Medicine adopted the scientific method,” James said… “It transformed medicine, and it’s easy to make the case.”
He talks about the testing and science applied to any new method, drug, procedure before it is allowed to be used by mainstream doctors. But…
But there is one important way in which medicine never quite adopted the scientific method… …once a treatment enters the mainstream — once we know whether it works in certain situations — science is largely left behind. The next questions — when to use it and on which patients — become matters of judgment, not measurement.
The article provides a dizzying array of treatments available to a doctor that is trying to treat heart disease. And what treatment is left up to the doctor’s judgment.
Cleary some hospitals and doctors have better average outcomes than others – so some doctors must have better judgment than others. Wouldn’t it be great if every doctor could start with a default procedure that has been shown to work for a patient that looks like the one the doctor is trying to treat and then modify it to fit the patient’s specifics?
Well…
“I thought there wasn’t anybody better in the world at twiddling the knobs than I was,” Jim Orme, a critical-care doctor, told me later, “so I was skeptical that any protocol generated by a group of people could do better.”
And that is just it – there are so many variations in treatments. Today’s instruments are quite complex and have many settings – so how do you know what works correctly? When the procedure is approved there is a fair amount of science recorded for each setting, and presumable most doctors follow it. But not all of them!
And this is where I think data-mining could come in. What if every single modern instrument was hooked into the network, and each adjustment was recorded? And linked to a patients medical file (so you could see history). Each time a nurse or doctor did something it was recorded. All of that in some standard format – and then shared across hospitals and doctors the country or world over?
This has, of course, been a dream for a while of electronic health care records. It always struck me as obvious that you would attach x-rays, CT scans, descriptions of medicine given, etc., but it never occurred to me the level of detail you could go into! From a technical point of view this is hard – the data is so non-uniform, unlike the particle physics experiments I work on, but the long term benefits could be quite good. The article describes when this data mining technique was applied ad-hoc in just a single hospital:
One widely circulated national study overseen by doctors at Massachusetts General Hospital had found an ARDS [Acute respiratory distress syndrome] survival rate of about 10 percent. For those in Intermountain’s study, the rate was 40 percent.
At any rate, this tickled my fancy, which is why I wrote about it. I found it ironic that on the Health home page yesterday there was also the following article:
Five years later, Medicare underwrites more than half of the $4 billion the nation now spends annually on defibrillators, but the agency is no closer to knowing how many lives that big investment is saving.
My impression of the health care bills working their way through congress right now is none of them really go after cost-savings. Science can help*.
* Ok – making devices that can spit out data in a common format will add to their cost. But you can do simple things like the Intermountain study to start as better devices come online!

I'm organizing a workshop!
. But the same electrons should also produce gamma rays when interacting with the interstellar light in the process known as the inverse Compton scattering (Inverse Compton was the younger brother of Arthur), the ICS in short. The claim is that Fermi has detected these ICS photons. You can even see it yourself if you stare long enough into the picture.
that does not fit a simple power-law spectrum expected from the background. The paper says that a dark matter particle of mass around 30 GeV annihilating into b quark pairs can fit the bump. The required annihilation cross section is fairly low, of order $10^{-25} cm^3/s$, only a factor of 3 larger than that needed to explain the observed abundance of dark matter via a thermal relic. That would put this dark matter particle closer to a standard WIMP, as opposed to the recently popular dark matter particles designed to explain the PAMELA positron excess who need a much larger mass and cross section.
decays into a many-body final state, which is possible in some extensions of the standard model. One popular theory where this could happen is the NMSSM - the 2.0 version of the MSSM with an additional singlet. Roughly, the Higgs could first decay into the new singlet, who in turn decays into two tau leptons, which amounts to Higgs decaying into four tau leptons. This funny decay topology could escape LEP searches even if the Higgs is as light as 86 GeV! That is the case not because of deep physical reasons, but simply because LEP collaborations were too lazy to search for it (in comparison, Higgs decaying into four b-quarks, which was studied by LEP, is excluded for the Higgs mass up to 110 GeV).
So, is the idea of the hidden light Higgs dead? It has definitely received a serious blow, but it can still survive in some perverse models where Higgs decays into four light jets, at least until someone ventures to kill that too. Anyway, never say dead; there is no experimental results that theorists could not find a way around ;-)In the morning, Jeremy Tinker (Berkeley) led our group meeting with a discussion of information about galaxy evolution from clustering. In the approximation that we know the dark matter model, the relationship between galaxies and dark matter can be parameterized and then the observed galaxy—galaxy clustering puts constraints on how the galaxies could possibly form and evolve. He has some counterintuitive results, from the fact that at intermediate redshift, the large-scale clustering of red and blue galaxies is very similar.
In the afternoon, Marc Kamionkowski (Caltech) gave the Big Apple Colloquium about the isotropy and homogeneity of large-scale structure, and in particular the cosmic microwave background. He is building non-natural models that permit anisotropy in the power spectrum while preserving isotropy in the temperature and density and all else. There is a small amount of evidence for this statistical anisotropy situation in the current data; it is a long shot but if it holds up it is extremely important.
This week, Henry Towsner concluded his portion of reading seminar of the Hrushovski paper, by discussing (a weaker, simplified version of) main model-theoretic theorem (Theorem 3.4 of Hrushovski), and described how this theorem implied the combinatorial application in Corollary 1.2 of Hrushovski. The presentation here differs slightly from that in Hrushovski’s paper, for instance by avoiding mention of the more general notions of S1 ideals and forking.
Here is a collection of resources so far on the Hrushovski paper:
— 1. Theorem 3.4 —
Here is a weakened version of Hrushovski’s Theorem 3.4:
Theorem 1 Let
be a countable model of a language extending the language of groups, with a universal extension
. Let
be a continuous, invariant Keisler measure that is also invariant under translations. Let
be a symmetric definable subset of
with
and
finite, and let
be the group generated by
. Let
be a wide type over
contained in
, and suppose that for every
, there exists
such that
and
are both wide.
Then the set
is a normal subgroup of
.
Remark 1 The condition about
and
seems to be a statement that there exist plenty of pairs
in
that are in “general position” somehow. In the case of abelian groups, it seems that this hypothesis not necessary. The hypothesis is for all
, but by homogeneity it suffices to verify it for a single
(since the action of the automorphism group of
is transitive on
).
Remark 2 Interestingly, it seems that no doubling condition on
is needed to prove this theorem, but the doubling condition arises when one wants to put
to use.
Remark 3 As
is wide,
is wide also. If we also assume that every finite product
has finite measure, this leads to the consequence that the index of
in
does not exceed
in cardinality (i.e. has bounded index, see Lemma 1 of Notes 2). Indeed, if this were not the case, then one could find more than
disjoint cosets
in
, and hence in some finite product
. By Section 4 of Notes 2,
is
-definable, and so is the intersection of countably many definable sets
. For any distinct
,
are disjoint, thus by saturation we have
non-empty for some integer
. By the Erdös-Rado theorem (an infinitary analogue of Ramsey’s theorem), we can then ass to a countable set of indices
in which the
are constant, thus we have a countable set of disjoint translates
, which live in some finite product
. But as
is wide,
has positive measure, while
has finite leasure, leading to a contradiction.
Remark 4 The set
can also be defined in a more “characteristic” fashion as the smallest
-definable subgroup of
of bounded index; we’ll return to this point in a later note (I think).
Remark 5 Here is one example of how the group
emerges. Start with a finite combinatorial model, in which
is a discrete interval
and the group is the integers; we normalise
to have measure
. Taking ultralimits, we end up with a non-standard discrete interval
in the non-standard integers, with
being the counting measure normalised to give
a measure of
. (This is not yet a saturated model, but yet us ignore this issue.) Note that the subsets
are definable for any standard natural number
. Their intersection
is a wide subgroup; if
was a “generic” type in
then
would be wide, and
would generate
(this is vaguely reminiscent of the “Bogulybov theorem” in additive combinatorics).
To apply Theorem 1, we need the following auxiliary lemma (a weakened version of Hrushovski Lemma 2.15):
Lemma 2 Let
be a definable set of positive measure. Then we can refine
to a wide type
such that for every
, there exists
such that
and
are both wide.
This lemma can be viewed as a more technical variant of Lemma 2 from Notes 2.
To prove this lemma we need a technical sublemma:
Lemma 3 Let
be a definable set of positive measure, and suppose that the product
is covered by two definable sets
(these are sets of pairs, of course). Then one can refine
to a definable subset
of positive measure, such that at least one of the following holds:
- (i) For every
, the set
has positive measure; or
- (ii) For every
, the set
has positive measure.
Proof: We can refine to be contained in
. At least one of
must have positive measure; let’s say that
. We can find a definable set
between
and
. If any of these
has positive measure, then we can just take
, so we may assume instead that
for all
, which implies that
by Fubini’s theorem (which can be extended to Kiesler measures, at least in the weak form we need here), giving the desired contradiction.
Now we prove Lemma 2. By starting with and refining repeatedly using Lemma 3 (enumerating all the definable sets,
and pairs of definable sets
, so that each pair is revisited infinitely often) one can find a type
refining
with the property that whenever
is a definable set containing
, and
is covered by two definable sets
, then either
has positive measure for all
, or
has positive measure for all
. In particular (setting
and
empty) this implies that
is wide.
Now let , and define
to be the type
, refined using the complement of all the sets of the form
for definable
with
, and the complement of the sets of the form
for definable
with
(note the asymmetry here!). We claim that this collection of sentences is still finitely satisfiable. Indeed, if this were not the case,
would have to be covered by
for some definable
with
. Write
, then the set
is definable and contains
, and thus contains
. If we set
, we thus see that
is a definable set containing
and
, contradicting the construction of
.
As is finitely satisfiable, we may extend it to a type
over
. Now let
be any realisation of
. We claim that
is wide. For if not, then
would be contained in a set
of measure zero for some definable
, but by construction
and hence
is contained in the complement of
, a contradiction.
We similarly claim that is wide. For if not, then
is contained in a set
of measure zero for some definable
. since
and
have the same type over
,
has measure zero also, and so
is contained in the complement of
by construction. In particular,
lies outside
, which is inconsistent with
lying in
. This proves Lemma 2.
Remark 6 As I understand it, the argument simplifies in the stable case, when the wideness of
over
is equivalent to the wideness of
over
.
— 2. Corollary 1.2 —
Assuming Theorem 1, we now establish the following combinatorial consequence, which is (a slightly weaker version of) Corollary 1.2 of Hrushovski:
Corollary 4 Let
be positive integers. Then for
sufficiently close to
, the following statement is true: whenever
is a finite subset of a group
with the bounded quintipling property
, where
, and for a proportion at least
of the
-tuples
, one has
(recall that
). Then there exists a subgroup
of
such that
is contained in at most
cosets of
.
We have weakened things a bit here by drawing the from
rather than
(and also weakening bounded tripling to bounded quintipling); this is in order to achieve some minor simplifications in the proof.
We now prove the corollary, by the usual “compactness and contradiction” method. If the corollary failed for some , then one can find a sequence of groups
with subsets
of tripling uniformly bounded by
, such that the proportion of
-tuples
in
with
approaches
, but such that there is no subgroup
of
which can cover
by
cosets.
Taking an ultralimit (and using counting measure normalised so that has mass
), we obtain a subset
of a group
and a continuous, invariant, translation-invariant Keisler measure
so that
,
, and such that the set
(say) has measure
. (Strictly speaking, one has to replace
by a definable predicate between
and
, but let’s ignore this technicality.) We then extend
to a universal model
.
The set has positive measure (at least
, in fact), so by Lemma 2, we may refine it to a wide type
obeying the hypotheses of Theorem 1, so that the set
is a group. Note that this set is contained in
.
Since contains the wide type
, it is itself wide. By Section 4 of Notes 2,
is
-definable, and is thus the intersection of definable sets
, which have positive measure as
is wide; we can place this inside the definable set
. As
has full measure in
, we see that
intersects
for every
, by saturation,
must also intersect
. In particular, there exists
in
such that
On the other hand, as is normal in
, the set
lies in
. This implies that we cannot find more than
disjoint translates
,
. As a consequence,
is covered by at most
translates of
, and so
can be covered by
such translates, including
itself. In particular,
is both
-definable and
-definable (it is the complement of all the other cosets in
). But any set
which is both
-definable and
-definable has to be definable. (Proof: we can write
as the intersection of definable sets
and also as the union of definable sets
. By saturation,
must be empty for at least one
, and the claim follows.)
To summarise, is now a definable group which can cover
by at most
cosets. This is a first-order statement and so descends back to the original finitary models, but this then contradicts the construction of such sets, and proves the Corollary.
Posted in Logic reading seminar, math.CO, math.LO Tagged: definability, Ehud Hrushovski, henry towsnerRemark 7 The argument not only shows the existence of the set
, but shows that this set is “definable”, in the sense that there are a finite number of first-order expressions using
and normalised cardinality, one of which is guaranteed to generate
whenever the hypotheses of Corollary 4 are satisfied. Indeed, if this were not the case, one could construct a sequence of counterexamples for which no such recipe for constructing
would work for sufficiently late elements of this sequence, and then run the above argument to obtain a contradiction.

I get a lot of press releases forward to me which usually get forwarded directly into my gmail archive. But this one I'm happy to pass along: Third Man Records is releasing A Glorious Dawn. You know the Carl Sagan remix (w/ guest appearance of the Hawkmeister) that I've been looping over and over again while I work:
Third Man Records is over the moon to announce the 7-inch release of "A Glorious Dawn" on November 9th.I fell asleep last night listening to an episode of Cosmos. Maybe that explains da alienz in my dreamz? Read the comments on this post...
...
The release is timed to coincide with the 75th anniversary of Sagan's birth. Also happening that day is a reception in United States' Congress with speeches by senators, NASA officials and assorted scientists, all hosted by the Planetary Society, which was co-founded by Sagan.Third Man Records, in conjunction with United Record Pressing, fabricated a special "Cosmos Colored Vinyl" of which 150 copies will be available...50 randomly inserted into mail orders for "A Glorious Dawn" and the remainder to be made available at the Third Man Records Nashville store front at noon on November 9th.
The one-sided single features a very special etching on the flipside. Reproduced from the original artwork, the etching copies the etching included with the Voyager Golden Record, set off into space in 1977 as the most elaborate message-in-a-bottle idea ever imagined. With its inclusion of Blind Willie Johnson's "Dark Was The Night" it goes without saying that the Voyager Golden Record is one of Third Man's favorite releases of all-time..
Lead paragraph from the Times Online UK about the latest LHC snafu:
The rehabilitation of the beleaguered Large Hadron Collider was on hold tonight after the failure of one of its powerful cooling units caused by an errant chunk of baguette.
"It's a question of character, of friendship. Hell, Leo, I ain't afraid to say it, it's a question of ethics." --Giovanni Gaspari
I'm back to lunchtime hoops after a two-week layoff due to teaching responsibilities. And this has reminded me of one of the great character tests that sports provide. Imagine that you're playing basketball, but are too tired to keep running with the fast break in both directions. You can't quit without pissing everybody off, though, and there's no-one you can have sub in for you. What do you do?
<a href="http://answers.polldaddy.com/poll/2219332/">What do you do when you're too tired to run the floor in a basketball game, but can't leave?</a><span style="font-size:9px;">(<a href="http://answers.polldaddy.com">opinion</a>)</span>
This question is a nearly infallible test of a person's character. Except for those annoying bastards who are in such good shape that they never get tired of running. You never can tell what those guys will do.
Read the comments on this post...A puppet commenter informs me that El Naschie is suing Nature. El Naschie, you may remember, was the journal editor of Chaos, Solitons and Fractals who was accused of not reviewing his own papers in the journal. To be expected, I suppose. But the commenter that pointed this out is entertaining:
Sarah Limbrick [Pontiff: writer of the above linked article about the suit] would surely be interested to know what the leading libel expert in England had to say about the Nature article complained of. He said he is in a state of disbelief that the worlds most respectable scientific journal Nature should publish an article which bears all the hallmarks of the tabloid press. Another interesting point is the conspiracy theory linking the plagiarism of El Naschies work published in Scientific American with the Nature article as well as a far worse article published in Die Zeit. Interestingly all of these three publications are owned by Macmillan. I understand from confidential sources that a mega surprise will be released at the trial engulfing highly reputed names some of whom are Nobel laureates.OOooh, Nobel laureates in a libel case and conspiracy theories to boot! That's bigger than the Scopes monkey trial! Read the comments on this post...
This is a photo I borrowed from Dara Norman's blog post over on NSBP's Vector. It is about the star party held over at hhe White House in October of this year. It looks like it was a lot of fun! Er, you may recognize one or two of those people at the telescope...
-cvj
Plot complexity is not, in and of itself, a measure of the quality of a work of communication, but it is surely a measure of something:
You might think it odd that work has already started to upgrade parts of the ATLAS detector, even before the LHC has started running!
As I wrote in a previous post, one of the key operating parameters for the LHC is luminosity, i.e., the beam intensity. The design calls for a luminosity of 1034/cm2/sec; this means that if you look at the beam head-on, it will contain 1034 protons/sec spread over an area of 1 sq. cm. In reality, each beam consists of 2800 bunches each containing ~1011 protons and about 0.03 mm in radius. Two such bunches collide every 25 nanoseconds, i.e., 40 million times/second; this is known as a bunch crossing.
The number of events that we collect is directly proportional to the luminosity. I had also written that the most common type of events that occur have a very large rate. What this implies is that when two proton bunches collide, most of the time we produce these “ordinary” events. For instance, when the luminosity is 1034, at each bunch crossing we get an average of 23 such events; they are easy to recognize, in the sense that they produce mainly low energy particles. So, when we produce an interesting event, say, a pair of top quarks or a Higgs boson, that event will be sitting atop this low-level “fuzz”. This background produces detectable signals in ATLAS, causing higher load on the trigger system and readout electronics, and pattern recognition problems in the reconstruction software. For instance, the (TRT) sub-detector, could have an occupancy rate as high as 10-20%, i.e., a non-negligible fraction of all wires will register hits; pattern recognition will be tough, but manageable. Other tracking sub-detectors are not a problem since they have much finer granulation (see Seth’s post on tracking). In addition, as particles produced in these “ordinary” events travel through the various sub-systems that are made out of silicon, they cause a slow degradation in them; detectors and electronics are made to resist this deterioration, but they do eventually fail.
When the luminosity starts to increase, as the LHC program calls for, these problems become much more serious. For instance, at a luminosity of 1035, each interesting event will be sitting atop a background of about 400 of these “junk” events; there is no way around it. As for the TRT, you can see that it will be useless.
The upgrade is planned in two phases. In the first phase, corresponding to a luminosity of 3*1034, currently scheduled for 2015 (but will depend on well the LHC performs), we plan to insert an extra layer of silicon sensors between the beam-pipe and the first silicon layer; by then, the first layer’s performance will have deteriorated enough to be noticeable. We will also upgrade some other detectors and electronics that sit close to the beam.
In the second phase, corresponding to a luminosity of 1035, all of the sub-systems that are used for tracking charged particles, i.e., the silicon layers and the TRT, will be replaced. All electronic readout for the detector will be replaced with more sophisticated stuff; the trigger logic and associated electronics will undergo a big overhaul, etc. This is scheduled for 2020.
These upgrades take time and effort, hence the early start. Most groups in the US and UK are already steering resources, in the form of people and money, towards the Phase I upgrade, e.g., my colleague, Hal Evans at Indiana University, is looking at some trigger improvements, and other countries are also joining in. We are used to this mode of operation; at one of my previous experiments (at the accelerator at Cornell University), we were simultaneously, analyzing data collected with version II of the experiment, starting to commission version II.5, and doing R\&D on version III!
–Vivek Jain, Indiana University
LHC started overheating when a cooling unit cut out
Time keeps on passing by. Right now, apples, pumpkins, squashes and persimmons are in season, amongst other vegetables and fruits. The fall weather has settled in nicely, and the usual seasonal events are happening. For example, this is the season when postdoctoral applications are made, and I find myself writing a lot of letters of recommendation. In over a month, I will be reading these applications with the rest of the folks here, to make decisions on who we are going to hire.
Imagine my surprise this week when going to a grocery store, they are already setting up their Christmas decorations. Isn’t it a bit early for that? I was hoping that they wold be setting up the `Turkey day’ decorations after Hallowe’en. This is not the only place. To me, it’s too early for the end of year Holiday decorations. Seems to me they want to rush the end of the year. Or maybe they have nothing to be thankful for.
Posted in Random, Santa Barbara
A fundamental tool in any mathematician’s toolkit is that of reductio ad absurdum: showing that a statement is false by assuming first that
is true, and showing that this leads to a logical contradiction. A particulary pure example of reductio ad absurdum occurs when establishing the non-existence of a hypothetically overpowered object or structure
, by showing that
’s powers are “self-defeating”: the very existence of
and its powers can be used (by some clever trick) to construct a counterexample to that power. Perhaps the most well-known example of a self-defeating object comes from the omnipotence paradox in philosophy (“Can an omnipotent being create a rock so heavy that He cannot lift it?”); more generally, a large number of other paradoxes in logic or philosophy can be reinterpreted as a proof that a certain overpowered object or structure does not exist.
In mathematics, perhaps the first example of a self-defeating object one encounters is that of a largest natural number:
Proposition 1 (No largest natural number) There does not exist a natural number
which is larger than all other natural numbers.
Proof: Suppose for contradiction that there was such a largest natural number . Then
is also a natural number which is strictly larger than
, contradicting the hypothesis that
is the largest natural number.
Note the argument does not apply to the extended natural number system in which one adjoins an additional object beyond the natural numbers, because
is defined equal to
. However, the above argument does show that the existence of a largest number is not compatible with the Peano axioms.
This argument, by the way, is perhaps the only mathematical argument I know of which is routinely taught to primary school children by other primary school children, thanks to the schoolyard game of naming the largest number. It is arguably one’s first exposure to a mathematical non-existence result, which seems innocuous at first but can be surprisingly deep, as such results preclude in advance all future attempts to establish existence of that object, no matter how much effort or ingenuity is poured into this task. One sees this in a typical instance of the above schoolyard game; one player tries furiously to cleverly construct some impressively huge number , but no matter how much effort is expended in doing so, the player is defeated by the simple response “… plus one!” (unless, of course,
is infinite, ill-defined, or otherwise not a natural number).
It is not only individual objects (such as natural numbers) which can be self-defeating; structures (such as orderings or enumerations) can also be self-defeating. (In modern set theory, one considers structures to themselves be a kind of object, and so the distinction between the two concepts is often blurred.) Here is one example (related to, but subtly different from, the previous one):
Proposition 2 (The natural numbers cannot be finitely enumerated) The natural numbers
cannot be written as
for any finite collection
of natural numbers.
Proof: Suppose for contradiction that such an enumeration existed. Then consider the number
; this is a natural number, but is larger than (and hence not equal to) any of the natural numbers
, contradicting the hypothesis that
is enumerated by
.
Here it is the enumeration which is self-defeating, rather than any individual natural number such as or
. (For this post, we allow enumerations to contain repetitions.)
The above argument may seem trivial, but a slight modification of it already gives an important result, namely Euclid’s theorem:
Proposition 3 (The primes cannot be finitely enumerated) The prime numbers
cannot be written as
for any finite collection of prime numbers.
Proof: Suppose for contradiction that such an enumeration existed. Then consider the natural number
; this is a natural number larger than
which is not divisible by any of the primes
. But, by the fundamental theorem of arithmetic (or by the method of Infinite descent, which is another classic application of reductio ad absurdum), every natural number larger than
must be divisible by some prime, contradicting the hypothesis that
is enumerated by
.
Remark 1 Continuing the number-theoretic theme, the “dueling conspiracies” arguments in a previous blog post can also be viewed as an instance of this type of “no-self-defeating-object”; for instance, a zero of the Riemann zeta function at
implies that the primes anti-correlate almost completely with the multiplicative function
, which is self-defeating because it also implies complete anti-correlation with
, and the two are incompatible. Thus we see that the prime number theorem (a much stronger version of Proposition 3) also emerges from this type of argument.
In this post I would like to collect several other well-known examples of this type of “no self-defeating object” argument. Each of these is well studied, and probably quite familiar to many of you, but I feel that by collecting them all in one place, the commonality of theme between these arguments becomes more apparent. (For instance, as we shall see, many well-known “paradoxes” in logic and philosophy can be interpreted mathematically as a rigorous “no self-defeating object” argument.)
— 1. Set theory —
Many famous foundational results in set theory come from a “no self-defeating object” argument. (Here, we shall be implicitly be using a standard axiomatic framework for set theory, such as Zermelo-Frankel set theory.) The basic idea here is that any sufficiently overpowered set-theoretic object is capable of encoding a version of the liar paradox (“this sentence is false”, or more generally a statement which can be shown to be logically equivalent to its negation) and thus lead to a contradiction. Consider for instance this variant of Russell’s paradox:
Proposition 4 (No universal set) There does not exist a set which contains all sets (including itself).
Proof: Suppose for contradiction that there existed a universal set which contained all sets. Using the axiom schema of specification, one can then construct the set
of all sets in the universe which did not contain themselves. As is universal,
is contained in
. But then, by definition of
, one sees that
if and only if
, a contradiction.
Remark 2 As a corollary, there also does not exist any set
which contains all other sets (not including itself), because the set
would then be universal.
One can “localise” the above argument to a smaller domain than the entire universe, leading to the important
Proposition 5 (Cantor’s theorem) Let
be a set. Then the power set
of
cannot be enumerated by
, i.e. one cannot write
for some collection
of subsets of
.
Proof: Suppose for contradiction that there existed a set whose power set
could be enumerated as
for some
. Using the axiom schema of specification, one can then construct the set
The set is an element of the power set
. As
is enumerated by
, we have
for some
. But then by the definition of
, one sees that
if and only if
, a contradiction.
As is well-known, one can adapt Cantor’s argument to the real line, showing that the reals are uncountable:
Proposition 6 (The real numbers cannot be countably enumerated) The real numbers
cannot be written as
for any countable collection
of real numbers.
Proof: Suppose for contradiction that there existed a countable enumeration of by a sequence
of real numbers. Consider the decimal expansion of each of these numbers. Note that, due to the well-known “
” issue, the decimal expansion may be non-unique, but each real number
has at most two decimal expansions. For each
, let
be a digit which is not equal to the
digit of any of the decimal expansions of
; this is always possible because there are ten digits to choose from and at most two decimal expansions of
. (One can avoid any implicit invocation of the axiom of choice here by setting
to be (say) the least digit which is not equal to the
digit of any of the decimal expansions of
.) Then the real number given by the decimal expansion
differs in the
digit from any of the decimal expansions of
for each
, and so is distinct from every
, a contradiction.
Remark 3 One can of course deduce Proposition 6 directly from Proposition 5, by using the decimal representation to embed
into
. But notice how the two arguments have a slightly different (though closely related) basis; the former argument proceeds by encoding the liar paradox, while the second proceeds by exploiting Cantor’s diagonal argument. The two perspectives are indeed a little different: for instance, Cantor’s diagonal argument can also be modified to establish the Arzela-Ascolí theorem, whereas I do not see any obvious way to prove that theorem by encoding the liar paradox.
Remark 4 It is an interesting psychological phenomenon that Proposition 6 is often considered far more unintuitive than any of the other propositions here, despite being in the same family of arguments; most people have no objection to the fact that every effort to finitely enumerate the natural numbers, for instance, is doomed to failure, but for some reason it is much harder to let go of the belief that, at some point, some sufficiently ingenious person will work out a way to countably enumerate the real numbers. I am not exactly sure why this disparity exists, but I suspect it is at least partly due to the fact that the rigorous construction of the real numbers is quite sophisticated and often not presented properly until the advanced undergraduate level. (Or perhaps it is because we do not play the game “enumerate the real numbers” often enough in schoolyards.)
Remark 5 One can also use the diagonal argument to show that any reasonable notion of a “constructible real number” must itself be non-constructive (this is related to the interesting number paradox). Part of the problem is that the question of determining whether a proposed construction of a real number is actually well-defined is a variant of the halting problem, which we will discuss below.
While a genuinely universal set is not possible in standard set theory, one at least has the notion of an ordinal, which contains all the ordinals less than it. (In the discussion below, we assume familiarity with the theory of ordinals.) One can modify the above arguments concerning sets to give analogous results about the ordinals. For instance:
Proposition 7 (Ordinals do not form a set) There does not exist a set that contains all the ordinals.
Proof: Suppose for contradiction that such a set existed. By the axiom schema of specification, one can then find a set which consists precisely of all the ordinals and nothing else. But then
is an ordinal which is not contained in
(by the axiom of foundation, also known as the axiom of regularity), a contradiction.
Remark 6 This proposition(together with the theory of ordinals) can be used to give a quick proof of Zorn’s lemma: see these lecture notes of mine for further discussion. Notice the similarity between this argument and the proof of Proposition 1.
Remark 7 Once one has Zorn’s lemma, one can show that various other classes of mathematical objects do not form sets. Consider for instance the class of all vector spaces. Observe that any chain of (real) vector spaces (ordered by inclusion) has an upper bound (namely the union or limit of these spaces); thus, if the class of all vector spaces was a set, then Zorn’s lemma would imply the existence of a maximal vector space
. But one can simply adjoin an additional element not already in
(e.g.
) to
, and contradict this maximality. (An alternate proof: every object is an element of some vector space, and in particular every set is an element of some vector space. If the class of all vector spaces formed a set, then by the axiom of union, we see that union of all vector spaces is a set also, contradicting Proposition 4.)
One can localise the above argument to smaller cardinalities, for instance:
Proposition 8 (Countable ordinals are uncountable) There does not exist a countable enumeration
of the countable ordinals. (Here we consider finite sets and countably infinite sets to both be countable.)
Proof: Suppose for contradiction that there exists a countable enumeration of the countable ordinals. Then the set
is also a countable ordinal, as is the set
. But
is not equal to any of the
(by the axiom of foundation), a contradiction.
Remark 8 With the axiom of choice, one can show the existence of uncountable ordinals (e.g. by well-ordering the reals), and then there exists a least uncountable ordinal
. By construction, this ordinal consists precisely of all the countable ordinals, but is itself uncountable, much as
consists precisely of all the finite natural numbers, but is itself infinite (Proposition 2). The least uncountable ordinal is notorious, among other things, for providing a host of counterexamples to various intuitively plausible assertions in point set topology, and in particular in showing that the topology of sufficiently uncountable spaces cannot always be adequately explored by countable objects such as sequences.
Remark 9 The existence of the least uncountable ordinal can explain why one cannot contradict Cantor’s theorem on the uncountability of the reals simply by iterating the diagonal argument (or any other algorithm) in an attempt to “exhaust” the reals. From transfinite induction we see that the diagonal argument allows one to assign a different real number to each countable ordinal, but this does not establish countability of the reals, because the set of all countable ordinals is itself uncountable. (This is similar to how one cannot contradict Proposition 4 by iterating the
map, as the set of all finite natural numbers is itself infinite.) In any event, even once one reaches the first uncountable ordinal, one may not yet have completely exhausted the reals; for instance, using the diagonal argument given in the proof of Proposition 6, only the real numbers in the interval
will ever be enumerated by this procedure. (Also, the question of whether all real numbers in
can be enumerated by the iterated diagonal algorithm requires the continuum hypothesis, and even with this hypothesis I am not sure whether the statement is decidable.)
— 2. Logic —
The “no self-defeating object” argument leads to a number of important non-existence results in logic. Again, the basic idea is to show that a sufficiently overpowered logical structure will eventually lead to the existence of a self-contradictory statement, such as the liar paradox. To state examples of this properly, one unfortunately has to invest a fair amount of time in first carefully setting up the language and theory of logic. I will not do so here, and instead use informal English sentences as a proxy for precise logical statements to convey a taste (but not a completely rigorous description) of these logical results here.
The liar paradox itself – the inability to assign a consistent truth value to “this sentence is false” – can be viewed as an argument demonstrating that there is no consistent way to interpret (i.e. assign a truth value to) sentences, when the sentences are (a) allowed to be self-referential, and (b) allowed to invoke the very notion of truth given by this interpretation. One’s first impulse is to say that the difficulty here lies more with (a) than with (b), but there is a clever trick, known as Quining (or indirect self-reference), which allows one to modify the liar paradox to produce a non-self-referential statement to which one still cannot assign a consistent truth value. The idea is to work not with fully formed sentences , which have a single truth value, but instead with predicates
, whose truth value depends on a variable
in some range. For instance,
may be “
is thirty-two characters long.”, and the range of
may be the set of strings (i.e. finite sequences of characters); then for every string
, the statement
(formed by replacing every appearance of
in
with
) is either true or false. For instance,
is true, but
is false. Crucially, predicates are themselves strings, and can thus be fed into themselves as input; for instance,
is false. If however
is the predicate “
is sixty-five characters long.”, observe that
is true.
Now consider the Quine predicate given by
“ is a predicate whose range is the set of strings, and
is false.”
whose range is the set of strings. Thus, for any string ,
is the sentence
“ is a predicate whose range is the set of strings, and
is false.”
This predicate is defined non-recursively, but the sentence captures the essence of the liar paradox: it is true if and only if it is false. This shows that there is no consistent way to interpret sentences in which the sentences are allowed to come from predicates, are allowed to use the concept of a string, and also allowed to use the concept of truth as given by that interpretation.
Note that the proof of Proposition 4 is basically the set-theoretic analogue of the above argument, with the connection being that one can identify a predicate with the set
.
By making one small modification to the above argument – replacing the notion of truth with the related notion of provability – one obtains the celebrated Gödel’s (second) incompleteness theorem:
Theorem 9 (Gödel’s incompleteness theorem) (Informal statement) No consistent logical system which has the notion of a string, can provide a proof of its own logical consistency. (Note that a proof can be viewed as a certain type of string.)
Remark 10 Because one can encode strings in numerical form (e.g. using the ASCII code), it is also true (informally speaking) that no consistent logical system which has the notion of a natural number, can provide a proof of its own logical consistency.
Proof: (Informal sketch only) Suppose for contradiction that one had a consistent logical system inside of which its consistency could be proven. Now let be the predicate given by
“ is a predicate whose range is the set of strings, and
is not provable”
and whose range is the set of strings. Define the Gödel sentence to be the string
. Then
is logically equivalent to the assertion “
is not provable”. Thus, if
were false, then
would be provable, which (by the consistency of the system) implies that
is true, a contradiction; thus,
must be true. Because the system is provably consistent, the above argument can be placed inside the system itself, to prove inside that system that
must be true; thus
is provable and
is then false, a contradiction. (It becomes quite necessary to carefully distinguish the notions of truth and provability (both inside a system and externally to that system) in order to get this argument straight!)
Remark 11 It is not hard to show that a consistent logical system cannot disprove its own consistency either; thus the consistency of such a system is undecidable within that system. Thus this theorem strengthens the (more well known) first Gödel incompleteness theory, which asserts the existence of undecidable statements inside a consistent logical system which contains the concept of a string (or a natural number). On the other hand, the incompleteness theorem does not preclude the possibility that the consistency of a weak theory could be proven in a strictly stronger theory (e.g. the consistency of Peano arithmetic is provable in Zermelo-Frankel set theory).
Remark 12 One can use the incompleteness theorem to establish the undecidability of other overpowered problems. For instance, Matiyasevich’s theorem demonstrates that the problem of determining the solvability of a system of Diophantine equations is, in general, undecidable, because one can encode statements such as the consistency of set theory inside such a system.
— 3. Computability —
One can adapt these arguments in logic to analogous arguments in the theory of computation; the basic idea here is to show that a sufficiently overpowered computer program cannot exist, by feeding the source code for that program into the program itself (or some slight modification thereof) to create a contradiction. As with logic, a properly rigorous formalisation of the theory of computation would require a fair amount of preliminary machinery, for instance to define the concept of Turing machine (or some other universal computer), and so I will once again use informal English sentences as an informal substitute for a precise programming language.
A fundamental “no self-defeating object” argument in the subject, analogous to the other liar paradox type arguments encountered previously, is the Turing halting theorem:
Theorem 10 (Turing halting theorem) There does not exist a program
which takes a string
as input, and determines in finite time whether
is a program (with no input) that halts in finite time.
Proof: Suppose for contradiction that such a program existed. Then one could easily modify
to create a variant program
, which takes a string
as input, and halts if and only if
is a program (with
itself as input) that does not halts in finite time. Indeed, all
has to do is call
with the string
, defined as the program (with no input) formed by declaring
to be the input string for the program
. If
determines that
does not halt, then
halts; otherwise, if
determines that
does halt, then
performs an infinite loop and does not halt. Then observe that
will halt if and only if it does not halt, a contradiction.
Remark 13 As one can imagine from the proofs, this result is closely related to, but not quite identical with, the Gödel incompleteness theorem. That latter theorem implies that the question of whether a given program halts or not in general is undecidable (consider a program designed to search for proofs of the inconsistency of set theory). By contrast, the halting theorem (roughly speaking) shows that this question is uncomputable (i.e. there is no algorithm to decide halting in general) rather than undecidable (i.e. there are programs whose halting can neither be proven nor disproven).
On the other hand, the halting theorem can be used to establish the incompleteness theorem. Indeed, suppose that all statements in a suitably strong and consistent logical system were either provable or disprovable. Then one could build a program that determined whether an input string
, when run as a program, halts in finite time, simply by searching for all proofs or disproofs of the statement “
halts in finite time”; this program is guaranteed to terminate with a correct answer by hypothesis.
Remark 14 While it is not possible for the halting problem for a given computing language to be computable in that language, it is certainly possible that it is computable in a strictly stronger language. When that is the case, one can then invoke Newcomb’s paradox to argue that the weaker language does not have unlimited “free will” in some sense.
Remark 15 In the language of recursion theory, the halting theorem asserts that the set of programs that do not halt is not a decidable set (or a recursive set). In fact, one can make the slightly stronger assertion that the set of programs that do not halt is not even a semi-decidable set (or a recursively enumerable set), i.e. there is no algorithm which takes a program as input and halts in finite time if and only if the input program does not halt. This is because the complementary set of programs that do halt is clearly semi-decidable (one simply runs the program until it halts, running forever if it does not), and so if the set of programs that do not halt is also semi-decidable, then it is decidable (by running both algorithms in parallel).
Remark 16 One can use the halting theorem to exclude overly general theories for certain types of mathematical objects. For instance, one cannot hope to find an algorithm to determine the existence of smooth solutions to arbitrary nonlinear partial differential equations, because it is possible to simulate a Turing machine using the laws of classical physics, which in turn can be modeled using (a moderately complicated system of) nonlinear PDE. Instead, progress in nonlinear PDE has instead proceeded by focusing on much more specific classes of such PDE (e.g. elliptic PDE, parabolic PDE, hyperbolic PDE, gauge theories, etc.).
One can place the halting theorem in a more “quantitative” form. Call a function computable if there exists a computer program which, when given a natural number
as input, returns
as output in finite time. Define the Busy Beaver function
by setting
to equal the largest output of any program of at most
characters in length (and taking no input), which halts in finite time. Note that there are only finitely many such programs for any given
, so
is well-defined. On the other hand, it is uncomputable, even to upper bound:
Proposition 11 There does not exist a computable function
such that one has
for all
.
Proof: Suppose for contradiction that there existed a computable function such that
for all
. We can use this to contradict the halting theorem, as follows. First observe that once the Busy Beaver function can be upper bounded by a computable function, then for any
, the run time of any halting program of length at most
can also be upper bounded by a computable function. This is because if a program of length
halts in finite time, then a trivial modification of that program (of length larger than
, but by a computable factor) is capable of outputting the run time of that program (by keeping track of a suitable “clock” variable, for instance). Applying the upper bound for Busy Beaver to that increased length, one obtains the bound on run time.
Now, to determine whether a given program halts in finite time or not, one simply simulates (runs) that program for time up to the computable upper bound of the possible running time of
, given by the length of
. If the program has not halted by then, then it never will. This provides a program
obeying the hypotheses of the halting theorem, a contradiction.
Remark 17 A variant of the argument shows that
grows faster than any computable function: thus if
is computable, then
for all sufficiently large
. We leave the proof of this result as an exercise to the reader.
Remark 18 Sadly, the most important unsolved problem in complexity theory, namely the
problem, does not seem to be susceptible to the no-self-defeating-object argument, basically because such arguments tend to be relativisable whereas the
problem is not; see this earlier blog post for more discussion. On the other hand, one has the curious feature that many proposed proofs that
appear to be self-defeating; this is most strikingly captured in the celebrated work of Razborov and Rudich, who showed (very roughly speaking) that any sufficiently “natural” proof that
could be used to disprove the existence of an object closely related to the belief that
, namely the existence of pseudorandom number generators. (I am told, though, that diagonalisation arguments can be used to prove some other inclusions or non-inclusions in complexity theory that are not subject to the relativisation barrier, though I do not know the details.)
— 4. Game theory —
Another basic example of the no-self-defeating-objects argument arises from game theory, namely the strategy stealing argument. Consider for instance a generalised version of naughts and crosses (tic-tac-toe), in which two players take turns placing naughts and crosses on some game board (not necessarily square, and not necessarily two-dimensional), with the naughts player going first, until a certain pattern of all naughts or all crosses is obtained, with the naughts player winning if the pattern is all naughts, and the crosses player winning if the pattern is all crosses. (If all positions are filled without either pattern occurring, the game is a draw.) We assume that the winning patterns for the cross player are exactly the same as the winning patterns for the naughts player (but with naughts replaced by crosses, of course).
Proposition 12 In any generalised version of naughts and crosses, there is no strategy for the second player (i.e. the crosses player) which is guaranteed to ensure victory.
Proof: Suppose for contradiction that the second player had such a winning strategy . The first player can then steal that strategy by placing a naught arbitrarily on the board, and then pretending to be the second player and using
accordingly. Note that occasionally, the
strategy will compel the naughts player to place a naught on the square that he or she has already occupied, but in such cases the naughts player may simply place the naught somewhere else instead. (It is not possible that the naughts player would run out of places, thus forcing a draw, because this would imply that
could lead to a draw as well, a contradiction.) If we denote this stolen strategy by
, then
guarantees a win for the naughts player; playing the
strategy for the naughts player against the
strategy for the crosses player, we obtain a contradiction.
Remark 19 The key point here is that in naughts and crosses games, it is possible to play a harmless move – a move which gives up the turn of play, but does not actually decrease one’s chance of winning. In games such as chess, there does not appear to be any analogue of the harmless move, and so it is not known whether black actually has a strategy guaranteed to win or not in chess, though it is suspected that this is not the case.
Remark 20 The Hales-Jewett theorem shows that for any fixed board length, an
-dimensional game of naughts and crosses is unable to end in a draw if
is sufficiently large. An induction argument shows that for any two-player game that terminates in bounded time in which draws are impossible, one player must have a guaranteed winning strategy; by the above proposition, this strategy must be a win for the naughts player. Note, however, that Proposition 12 provides no information as to how to locate this winning strategy, other than that this strategy belongs to the naughts player. Nevertheless, this gives a second example in which the no-self-defeating-object argument can be used to ensure the existence of some object, rather than the non-existence of an object. (The first example was the prime number theorem, discussed earlier.)
The strategy-stealing argument can be applied to real-world economics and finance, though as with any other application of mathematics to the real world, one has to be careful as to the implicit assumptions one is making about reality and how it conforms to one’s mathematical model when doing so. For instance, one can argue that in any market or other economic system in which the net amount of money is approximately constant, it is not possible to locate a universal trading strategy which is guaranteed to make money for the user of that strategy, since if everyone applied that strategy then the net amount of money in the system would increase, a contradiction. Note however that there are many loopholes here; it may be that the strategy is difficult to copy, or relies on exploiting some other group of participants who are unaware or unable to use the strategy, and would then lose money (though in such a case, the strategy is not truly universal as it would stop working once enough people used it). Unfortunately, there can be strong psychological factors that can cause people to override the conclusions of such strategy-stealing arguments with their own rationalisations, as can be seen, for instance, in the perennial popularity of pyramid schemes, or to a lesser extent, market bubbles (though one has to be careful about applying the strategy-stealing argument in the latter case, since it is possible to have net wealth creation through external factors such as advances in technology).
Note also that the strategy-stealing argument also limits the universal predictive power of technical analysis to provide predictions other than that the prices obey a martingale, though again there are loopholes in the case of markets that are illiquid or highly volatile.
— 5. Physics —
In a similar vein, one can try to adapt the no-self-defeating-object argument from mathematics to physics, but again one has to be much more careful with various physical and metaphysical assumptions that may be implicit in one’s argument. For instance, one can argue that under the laws of special relativity, it is not possible to construct a totally immovable object. The argument would be that if one could construct an immovable object in one inertial reference frame, then by the principle of relativity it should be possible to construct an object
which is immovable in another inertial reference frame which is moving with respect to the first; setting the two on a collision course, we obtain the classic contradiction between an irresistible force and an immovable object. Note however that there are several loopholes here which allow one to avoid contradiction; for instance, the two objects
could simply pass through each other without interacting.
In a somewhat similar vein, using the laws of special relativity one can argue that it is not possible to systematically generate and detect tachyon particles – particles traveling faster than the speed of light – because these could be used to transmit localised information faster than the speed of light, and then (by the principle of relativity) to send localised information back into the past, from one location to a distant one. Setting up a second tachyon beam to reflect this information back to the original location, one could then send localised information back to one’s own past (rather than to the past of an observer at a distant location), allowing one to set up a classic grandfather paradox. However, as before, there are a large number of loopholes in this argument which could let one avoid contradiction; for instance, if the apparatus needed to set up the tachyon beam may be larger than the distance the beam travels (as is for instance the case in Mexican wave-type tachyon beams) then there is no causality paradox; another loophole is if the tachyon beam is not fully localised, but propagates in spacetime in a manner to interfere with the second tachyon beam. A third loophole occurs if the universe exhibits quantum behaviour (in particular, the ability to exist in entangled states) instead of non-quantum behaviour, which allows for such superluminal mechanisms as wave function collapse to occur without any threat to causality or the principle of relativity. A fourth loophole occurs if the effects of relativistic gravity (i.e. general relativity) become significant. Nevertheless, the paradoxical effect of time travel is so strong that this physical argument is a fairly convincing way to rule out many commonly imagined types of faster-than-light travel or communication (and we have a number of other arguments too that exclude more modes of faster-than-light behaviour, though this is an entire blog post topic in its own right).
Posted in expository, math.LO Tagged: cantor's theorem, godel incompleteness theorem, halting problem
It can’t be easy being the guy who has to introduce Albert Einstein. But it helps if you’re George Bernard Shaw.
You have to love YouTube, although this is only an excerpt from a somewhat longer speech. Most of the text is here.
Alright, it’s time to start wrapping things up a bit. I’ve been going on for some time now about how symmetries play a central role in our understanding of physics. Here’s a lightning review:
Here’s a summary in graphical form:

If you wanted a nice summary in the format of a nice TED talk, (I know Mike A. is a fan), then I recommend Marcus du Sautoy’s talk earlier this year:
[There is a video that cannot be displayed in this feed. Visit the blog entry to see the video.]
Now I’d like to go over some more formal results with far-reaching effects in physics, i.e. some “advanced topics.” These are usually things which are derived rigorously in successively more advanced physics courses, but here we’ll just give heuristic explanations that highlight the physical relevance. Though the topics are somewhat high brow in their nature, they address very simple questions that I think should be very accessible.
Where do conservation laws come from?
Emmy Noether was a prominent physicist and mathematician in the early 1900s when those fields were dramatically dominated by men. Today every undergraduate physics student learns Noether’s Theorem as part of analytic mechanics. The theorem can be summarized as this:
For every continuous symmetry, there is a conserved charge.
What does this mean? The first part refers to a continuous symmetry. These are like the spacetime symmetries that we discussed in part 1: rotations, translations and their relativistic generalizations (Lorentz transformations). The word continuous means that you can perform the symmetry by any arbitrary amount, as opposed to discrete symmetries (such as those in part 2).
The second part says that if you have a continuous symmetry, then you have a conserved quantity which we call charge. This is something you’re already familiar with: we know that electrons carry electric charge and that this charge is conserved: it is neither created nor destroyed, and every interaction between particles must have the same charge going out as it did going in. For example, if ten physicists entered a bar and only nine left by closing time, then the number of physicists is not conserved. (Maybe one of them had a change of heart and became a mathematician.)
This is really neat, because now we can explain the existence of conserved charges in terms of the existence of a symmetry in nature. Here are a few well known examples from non-relativistic classical physics:
(I once convinced myself that if you think about this for a while, it makes sense ‘intuitively’ without any mathematics. However, this depends on what you mean by ‘intuitive.’) This is now really useful because physicists building theories can generate conserved charges just by imposing that the theory obeys some symmetry.
Where do forces come from?
There’s a particular kind of symmetry that particle physicists are particularly interested in: gauge symmetries. These symmetries are special because they give us gauge bosons, otherwise known as force particles. Thus we can also say that symmetry is the reason why we have forces between particles.
There are two complementary ways in which one can view gauge symmetry. The first way is to think of it as an additional feature of a regular global symmetry. For example, we can say that there is an internal symmetry that causes us to have three ‘colors’ of each quark. We can then “promote” this symmetry to a gauge symmetry and this forces us to introduce a new force particle (the gluon) which connects particles which have color charge. (This color charge, of course, comes from Noether’s theorem above.) This works by saying that we are promoting a global symmetry into a local symmetry, meaning that we allow ourselves to do different symmetry transformations at different points in space(time).
For example, electric charge comes from a simple symmetry called U(1). This is the symmetry of a circle. So we can imagine a little circle living at every point in spacetime, and at every point there is some point on the circle which tells a property of the electron field. What is this property? It’s a complex phase, basically a complex number of length 1 that multiplies the electron variable in our theories. This isn’t a “real” circle since it’s an internal symmetry, we’re just imagining this to help us understand what’s going on. A global symmetry would tell us that if we move the point on the circle by the same amount everywhere in space, then the laws of physics should be the same:

A gauge symmetry, however, is one where we can do an independent rotation at each point in space (i.e. a local transformation):

If a symmetry is a gauge symmetry, then this transformation should still leave the laws of physics the same. Note that the local symmetry is much more restrictive than the global symmetry. Why does this give us force particles? If all of the arrows are mutually misaligned, it turns out that we need to define a way to go from the orientation of an arrow at point x to the orientation of the arrow at point y. In differential geometry this is called a connection. In physics it’s called a gauge field. This “gauge field” object which tells us how the arrows point is precisely what we call force particles! So the photon is just the “gauge field” which connects the U(1) orientations of electrically charged particles. Particles that aren’t charged don’t have a U(1) orientation, and so the photon doesn’t “see” them.
As a side note, the second way to think of this is to say that if the arrows can point in any direction then this piece of information is actually a redundancy in our theory. In other words, gauge symmetries can be thought of as not arising from some natural principle, but as a human-made invention that introduces redundancy to makes nature more mathematically tractable. If you think about this for a while this becomes something of an ontological question, which is not what we want to get into; but the point is that quantum effects tend to break lots of symmetries. If we create a gauge symmetry that is redundant, we ought to make sure that quantum effects don’t mess it up or else our theory in completely inconsistent. The quantum effects are called anomalies, and this leads to a powerful tool in theoretical physics called anomaly cancellation. (It was a cancellation of anomalies in 1984 by Green and Schwarz that led to the first superstring revolution.)
Where do massless particles come from?
These are a weird questions Why should I ask where massless particles come from? Why is this different from asking where any particle comes from?
Massless particles are somewhat special in quantum field theory. It turns out that they can lead to apparent theoretical inconsistencies (so-called “infrared divergences”) and quantum mechanics usually conspires to give particles mass unless (ta-da) such masses are protected against by some symmetry. (An example of this is chiral symmetry, i.e. the left-handed and right-handed structure of the Standard Model fermions, that prevents fermions from getting mass other than by the Higgs mechanism. If that didn’t make sense, then don’t worry about it.)
This is related to Goldstone’s theorem, which tells us that:
If a continuous global symmetry is broken, then there are massless particles in the theory.
By “continuous global symmetry” we mean something like the U(1) symmetry of the circle described above or its higher-dimensional generalizations. I’ll have to be necessarily sketchy here, but the picture that everyone always draws is the so-called “Mexican hat potential”:

The idea is that we can imagine a quantum field as a marble rolling around in that Mexican-hat-shaped bowl. The position of the marble represents the “vacuum expectation value” of the field. Here ‘vacuum expectation value‘ (vev) is code for whether or not the particle exists everywhere in space without any additional quantum effects. Most potentials don’t have that dimple in the middle, so the ‘marble’ wants to sit right in the center where the field takes zero vacuum expectation value. This is what most particles do. For the Higgs, however the vacuum expectation value is nonzero — the Mexican-hat shaped bowl pushes the Higgs to live on the ring where the hat is flat. This means that the Higgs “exists everywhere” in space even without having to be created through quantum interactions. Indeed, it is the interaction of particles with this “omnipresent” Higgs field that gives those particles mass.
Now, Goldstone’s theorem is the observation that the shape of the bowl actually corresponds to the mass of the particle rolling around in it. This is part of the structure of all quantum field theories, and you can take it as a stated fact. The key observation is that the Higgs particle still has a direction where it is free to roll around, namely the angular direction. Goldstone tells us that these directions (in “field space,” i.e. in the space of possible Higgs vacuum values, not physical space) correspond to massless excitations of the Higgs field.
So you can think about it this way: the fact that the Higgs field doesn’t live at the center of the potential (it’s pushed out by the dimple) tells us that the Higgs lives everywhere in space. Particles interactions with this “everywhere in space” field gives them mass. In addition, however, there is a certain way that we can jiggle this “everywhere in space” field that corresponds to massless particles. This is just like the ripples one gets from throwing a rock into a pond.
Where do massive force particles come from?
This massless Higgs field coming form Goldstone’s theorem relates to a different problem in field theory related to gauge bosons, i.e. force particles. It turns out that if we write a theory with massive force particles (as we observe with the W and Z bosons), this usually leads to apparent theoretical inconsistencies (a “breakdown of unitarity”), e.g. probabilities that are greater than 100%. However, it turns out that if the force particles get their masses in a very specific way, i.e. via a process called spontaneous symmetry breaking, then everything is okay. Further, it turns out that everything is okay because—you guess it—there is symmetry in the theory.
The point is this: the W and Z bosons get their masses from “eating” the massless Higgs particles. What this means is that the physical W and Z bosons actually contain part of the Higgs field! More technically, the W and Z fields mix quantum mechanically with the massless Higgs particles. This becomes a rather technical point, so I’ll leave it at that. Don’t worry if you’re unhappy with the word “eat” in this context. It is commonly used even in quantum field theory textbooks and bugged me for a long time until I felt like I properly understood what was going on.
Summary and Outlook
I’ve been going on for quite some time now about symmetries and how they’re important in particle physics. Let me close with the following summary:
In future posts I’ll spend more time talking about my research, in which the theoretical physics community has been using symmetry in novel ways to go beyond the Standard Model. To whet your appetite, this includes extending spacetime symmetry (extra dimensions), imposing a matter particle/force particle symmetry (supersymmetry), and incorporating all of the Standard Model gauge symmetries into a single symmetry (grand unification).
Today's Quantum Optics lecture is about quantum computing experiments, and how different types of systems stack up. Quantum computing, as you probably know if you're reading this blog, is based on building a computer whose "bits" can not only take on "0" and "1" states, but arbitrary superpositions of "0" and "1". Such a computer would be able to out-perform any classical computer on certain types of problems, and would open the exciting possibility of a windows installation that is both working and hung up at the same time.
There are roughly as many types of proposed quantum computers as there are people working on quantum computation. It's not clear which of them, if any, will eventually prove to be useful, meaning that this is the perfect subject for a blog poll:
<a href="http://answers.polldaddy.com/poll/2217973/">The quantum computers of the future will be based on:</a><span style="font-size:9px;">(<a href="http://www.polldaddy.com">survey</a>)</span>
While this is a poll about quantum computing, the machines running the poll are strictly classical, so you can only choose one option.
Read the comments on this post...The Rado graph, or random graph, seems to be an extraordinary entity. Take countably many nodes, then for each pair of nodes flip a coin and if it shows heads, draw an edge between them. Almost surely you will have generated the Rado graph, .
Any finite graph (and indeed any countable graph) is contained in , not just in the sense of being embeddedable, but in the sense of being an induced subgraph, that is, it is the full subgraph on a subset of nodes. Along with this universality, is also homogenous in the sense that any isomorphism between finite induced subgraphs extends to an automorphism of all of .
is very robust, you can delete finitely many vertices, add or remove finitely many edges, or interchange edges and non-edges, and you still end up with a graph isomorphic to . Furthermore, the odd thing about the construction of this graph is that I didn’t have to tell you the probability of the coin showing heads. So long as is in and the tosses are independent, the Rado graph almost surely emerges.
Not only this, there are many ways to generate it without using random devices. Rado himself took the nodes to be the natural numbers, and an edge between and whenever either the th bit of the binary representation of is nonzero, or the th bit of the binary representation of is nonzero. Yet another method has us take as nodes the prime numbers equal to 1 mod 4. Then we join and if they are quadratic residues of each other.
I find myself interested in because it is an example of a Fraïssé limit. This is nicely explained in Artem Chernikov’s slides – What is…Fraïssé construction? We want a countable category, , of finitely generated structures (for a countable signature) whose morphisms are embeddings, which satisfies the three properties:
So now some questions: Can this kind of limit be given a natural category theoretic description? Is it related to any of the completions we have listed here? It seems to be a kind of Ind-completion, pretty close to an ideal completion. Can we understand the homogeneity category theoretically?
I should imagine I need to get a look at this thesis – Trevor Irwin, Fraïssé limits and colimits with applications to continua:
The classical Fraïssé construction is a method of taking a direct limit of a family of finite models of a language provided the family fulfills certain amalgamation conditions. The limit is a countable model of the same language which can be characterized by its (injective) homogeneity and universality with respect to the initial family of models. A standard example is the family of finite linear orders for which the Fraïssé limit is the rational numbers with the usual ordering.
We present this classical construction via category theory, and within this context we introduce the dual construction. This respectively constitutes the Fraïssé colimits and limits indicated in the title. We provide several examples.
We then present the projective Fraïssé limit as a special case of the dual construction, and as such it is the categorical dual to the classical (injective) Fraïssé limit. In this dualization we use a notion of model theoretic structure which has a topological ingredient. This results in the countable limit structures being replaced by structures which are zero-dimensional, compact, second countable spaces with the property that the relations are closed and the functions are continuous.
We apply the theory of projective Fraïssé limits to the pseudo-arc by first representing the pseudo-arc as a natural quotient of a projective Fraïssé limit. Using this representation we derive topological properties of the pseudo-arc as consequences of the properties of projective Fraïssé limits. We thereby obtain a new proof of Mioduszewski’s result that the pseudo-arc is surjectively universal among chainable continua, and also a homogeneity theorem for the pseudo-arc which is a strengthening of a result due to Lewis and Smith. We also find a new characterization of the pseudo-arc via the homogeneity property.
We continue with further applications of these methods to a class of continua known as pseudo-solenoids, and achieve analogous results for the universal pseudo-solenoid.
Available treatments include Wieslaw Kubiś, Fraisse sequences - a category-theoretic approach to universal homogeneous structures:
We present a category-theoretic approach to universal homogeneous objects, with applications in the theory of Banach spaces and in set-theoretic topology,
and Olivia Caramello, Fraïssé’s construction from a topos-theoretic perspective:
We present a topos-theoretic interpretation of (a categorical generalization of) Fraisse’s construction in model theory, with applications to countably categorical theories.
Hosting for the resulting dataset will be provided by Amazon in S3, and freely available to all EC2 users.
In addition, the code used to create and process the dataset will be available for download”
Click here for all of my del.icio.us bookmarks.
Well, that was not altogether terrible, I gather, but I was so tired and a little bit out of sorts* when I started giving the talk that I was running very slowly. So by time I got to the end (even though I hit my right pace later on) I was some 15 minutes over. Ugh. People seemed to like it, but I definitely need to whittle out a few of the more superfluous slides (which I'd always intended to) and strike the intended pace I wanted to hit earlier than 3/4 of the way through the talk. My reward (besides some nice remarks at the end of the talk that sort of made my day) was a visit to another excellent restaurant for dinner, with some good company. The extraordinarily good Vij's. The hour's wait for the food was perfectly fine given the nibbles they bring you while you wait, and the food itself once you get it. Here's a shot or two of some of the excellent food just after its arrival, and the empty dishes and satisfied hosts just after polishing it all off: [...]I don’t really know what to think of this, but apparently there’s a news story going around that a piece of bread found its way onto an electrical connection in one of the LHC’s above-ground buildings leading to a failure of a cooling unit. The buildings are protected by high security fences so the story is that it was probably dropped by a bird. The upside is that they don’t expect this to make a dent in the current LHC start up schedule.
[As someone who has yet to visit CERN, I have to wonder how such a think could actually occur. Is it so easy for a bird to get into one of these buildings? Should someone make silly references to Spiderman 3?]
I did my best to try to track this story down to a reputable source, hopefully the US/LHC bloggers currently at CERN might be able to comment? (Mike? Seth? See any birds lately?
)
I first saw this story reported on the social news site Reddit, where it gathered plenty of rather funny comments, e.g. “So… what IS the average flight velocity of a baguette-laden swallow?” Reddit linked to a report in Popular Science, which in turn linked to The Register which claimed as its source a press briefing by Mike Lamont, the LHC Machine Coordinator. At the time of this writing, most of the Google News references to this incident point to the article at The Register, though some spiced it up with some of their own illustrations, e.g. the Crunch Gear blog.
Knowing The Register’s history of making tongue-in-cheek statements, I dug a little bit deeper and I was finally able to track down a story from United Press International under their “Odd News” section with some details. This story finally references The Times of London, which has a nice write up by Nico Hines. It sounds like the fail-safes kicked in properly and it is expected that the LHC can stick to its scheduled restart (keeping in mind that this schedule gets regular minor updates).
So, on the one hand, it sounds like the story is legit. On the other hand, I maybe I should feel guilty that by posting this since I’m playing the role of “yet another blogger posting on a breaking news item and contributing nothing new other than a few snarky comments.” (What does this say about the state of blog-based journalism these days?)
[Hey, at least I provided a paper trail of references like an honest academic.]

DNA is a perfect raw material for constructing nanoscale structures. Since base-pairing has been selected by evolution to be highly specific, it is easy to design sequences that will link up with their proper mates. In this way, we can treat small pieces of DNA like Tinkertoys, designing individual components and then allowing them to assemble when we put them together. In addition, the chemistry of DNA synthesis has been completely automated, so custom pieces of DNA can be easily constructed, or even ordered from commercial biotech companies. This puts DNA nanotechnology in the hands of any modest laboratory, and many laboratories have taken advantage of this, creating nanoscale scaffolds, tweezers, polyhedra, computers, and even tiny illustrations composed entirely of DNA.
DNA has the characteristic mix of flexibility and rigidity that is the hallmark of biological molecules. If the sequence of bases is correct, it zips up into a double helix that, at least in short lengths, is a sturdy cylinder. Longer stretches, however, start to show flexibility, and the DNA helix curves and bends. The trick in designing a DNA infrastructure is to develop ways to rigidify the overall structure. In most cases, this has been done by having the DNA strand weave back and forth between many parallel double helices. In this way, the bundle of helices form a structure that is far more rigid than a single helix.
Nadrian Seeman pioneered the use of DNA for building nanoscale structures. After decades of work, the structure shown here, from PDB entry 3gbi, is the first crystal structure of a DNA lattice completely designed from scratch. It is built of small 3D triangular subunits, each composed of three separate types of DNA strands. The base sequences are carefully chosen so that they assemble into this one particular structure, and not any others. At the corners of the 3D triangle, there are sticky ends that link to other triangles, stacking up in a predicable way into a three-dimensional scaffold.
Read the rest from David Goodsell at the RCSB PDB here. We’ve covered some of this kind of work before, too.
These LHC posts appear in two places: the main blog site, and the facebook page. The main site supports video, which currently don’t show up in the facebook posts.
Two of my previous posts had video, so check those out on the main site!
Also notice the “View Original Post” link at the bottom of these which takes you to the main site.
A long time ago, a massive star about 10,000 light years from Earth went kaboom.
329 years ago, we think, in 1680, the light from the supernova explosion reached Earth and was recorded as a new star by the Flamsteed, then the Astronomer Royal, looking relatively dim as nearby supernove go, due to the layers of dust in the galaxy between us and the site of the explosion.
Now, digging into archival x-ray data, a couple of astronomers may finally have figured out what is going on in Cass A.
Read the rest of this post... | Read the comments on this post...For this week's Baby Blogging, we have a shot of Kate helping SteelyKid with her new favorite game:

It's called "Take off my shoes, and put them back on." She can play this for hours. It would be even cuter if she could do the putting on and taking off herself, but alas, she's still kind of unclear on the solidity of objects, and doesn't really grasp that her feet can only get into the shoes from the open end. She'll get there, though.
SteelyKid officially moved out of the infant room at day care this week. She's now in the next age/ development group, termed "Waddlers" (an intermediate step between "Infants" and "Toddlers"). I very briefly considered going for the alliterative "Wednesday Waddler Blogging," but regained my senses in time. We're going to stick with "Thursday Baby Blogging" for a while yet.
Read the comments on this post...This week, Henry Towsner continued some model-theoretic preliminaries for the reading seminar of the Hrushovski paper, particularly regarding the behaviour of wide types, leading up to the main model-theoretic theorem (Theorem 3.4 of Hrushovski) which in turn implies the various combinatorial applications (such as Corollary 1.2 of Hrushovski). Henry’s notes can be found here.
A key theme here is the phenomenon that any pair of large sets contained inside a definable set of finite measure (such as ) must intersect if they are sufficiently “generic”; the notion of a wide type is designed, in part, to capture this notion of genericity.
— 1. Global types —
Throughout this post, we begin with a countable structure of a language
, and then consider a universal elementary extension
of
(i.e. one that obeys the saturation and homogeneity properties as discussed in Notes 1. Later on,
will contain the language of groups, and then we will rename
as
to emphasise this.
Recall from Notes 1 that a partial type over a set is a set of formulae (with
variables for some fixed
) using
as constant symbols, which is consistent and contains the theory of
; if the set of formulae is maximal (i.e. complete), then it is a type. One can also think of a type as an ultrafilter over the
-definable sets; if
is a type and
is an
-definable set given by some formula
, then either
lies in
(in which case we write
) or
lies in
(in which case we write
) but not both.
When the set is small (i.e. has cardinality less than that of
, which in particular would be true of
consisted of
union with a finite set, which is a very typical situation), then by saturation one can identify types (or partial types) with the subset
of
that they cut out. In particular, these sets are non-empty. Adding more formulae to a partial type corresponds to shrinking the set that they cut out, and vice versa.
However, if we have a global type – a type defined over the entire model
– then one can no longer identify types with the set
that they cut out, because these sets are usually empty! However, what we can do is restrict
to some smaller set
of constants to create a type
over
, defined as the set of all formulae in
that only involve the constants in
. It is easy to see that this is still a type, and if
is small, it cuts out a non-empty set in
.
A global type is said to be
-invariant, or invariant for short, if the set of formulae in
is invariant under any automorphism
of
that fixes
. In particular, given any
-definable set
and
, we see that
if and only if
, where
is a slice of
. (Indeed, this gives an equivalent definition of invariance.)
A trivial example of an invariant global type would be the type of an element (or in
). This cuts out a singleton set
. This is in fact the only invariant global type that cuts out anything at all:
Lemma 1 Let
be a global invariant type. Then
is realisable in
(i.e.
is non-empty) if and only if it is realisable in
(and is the type of a single element in
).
Proof: Suppose is realisable in
by some
. Since the formula
is definable in
, we see that
, i.e.
cuts out precisely the singleton set
. As
is invariant,
must then be invariant under all
-fixing automorphisms of
. By homogeneity, this means that there is no element distinct from
which is elementarily indistinguishable from
over
; in other words,
is the set cut out by the type
of
over
.
By saturation, the formula together with the formulae in
is not satisfiable, hence not finitely satisfiable. Thus there is a finite set of formulae in
that cut out
, i.e.
is definable over
. But as
is an elementary extension of
, these formulae must also be realisable in
, i.e.
lies in
, and the claim follows.
(Because of this, one should regard the notation carefully; the set
that
cuts out in the model
may in fact be empty, but when we write
for some definable
, we interpret this syntactically rather than semantically (or equivalently, that
holds in all extensions
of
, and not just in
itself.)
On the other hand, invariant global types exist in abundance:
Lemma 2 Let
be a type over
. Then there exists an invariant global type
that refines
(i.e. it contains all the formulae that
does).
Proof: We view as a collection of logically consistent formulae. We enlarge this collection to a larger one
by adding in the negations of all the formulae definable over
that are not realisable in
. Observe that this collection remains logically consistent, because any finite set of formulae in
were realisable in
, hence in
(which is an elementary substructure). Hence, by Zorn’s lemma, one can extend
to a global type
.
We now claim that is invariant. Indeed, let
be a sentence over
that is contained in
, and let
be an automorphism that fixes
. If
is not in
, then
must be in
(by completeness), and hence
is in
also, and hence must be realisable in
(otherwise its negation would be in
, and hence in
). But this is absurd since
fixes
. Thus
does lie in
, yielding invariance.
A major use of invariant global types for us will be that they can be used to generate sequences of indiscernibles (as defined in previous notes):
Lemma 3 Let
be a global invariant type of some arity
, and construct recursively a sequence
by requiring
for all
. (This is always possible since types are satisfiable once restricted to small sets, by saturation, as discussed earlier). Then
are indiscernible over
, i.e. the tuples
for
are elementarily indistinguishable (over
) for any fixed
.
Proof: This is achieved by an induction on . The
case is clear since the
all have type
over
. Now we do the
case. It suffices to show that
and
are elementarily indistinguishable over
for all
.
By construction, and
have the same type over
, and so
and
are elementarily indistinguishable over
. So it remains to show that
and
are elementarily indistinguishable.
Let be an
-definable relation that contains
; we need to show that
contains
also.
Since and
have the same type over
, by homogeneity there exists an automorphism
of
fixing
that maps
to
. Since
realises
, we see that
contains the sentence
, hence by invariance
contains
also. Since
realises
, we conclude
, as required.
This concludes the case. The higher
case is similar and is left as an exercise.
— 2. Intersections of wide types —
Now we assume that the structure is equipped with an
-invariant Kiesler measure
. This leads to the notion of a wide type – a type such that all the
-definable sets containing this type have positive measure. Intuitively, elements of a wide type are distributed “generically” in the structure.
In the previous notes we showed that wide types can be “split” amongst indiscernables, as follows:
Lemma 4 Let
be an element or tuple in
, let
be a wide type over
for some set of constants
, and let
be a sequence of indiscernibles (over
) that has the same type as
(over
). Then for any finite number
in this sequence, one can find a type
such that
has the same type over
as
does over
, for all
.
We now use this lemma to show that sets defined by wide types intersect each other in a uniform fashion.
Lemma 5 Let
be types over
, and let
,
be realisations of
such that
and
are wide. Let
,
be
-definable sets with parameters, contained inside an
-definable set
of finite measure; then
if and only if
.
Proof: By homogeneity, there is an automorphism fixing that sends
to
, and maps
to another element of
. Thus without loss of generality we may assume
.
We assume for contradiction that and
.
By Lemma 2, we may extend to an invariant global type
. Observe that for any
, either one has
for all
, or one has
for all
(since there is a
-definable set between
and
. Suppose first that the former option holds for some
, thus there is a uniform lower bound
. We now define a sequence
and an indiscernible sequence
as follows:
Let be the set
, then observe from the above construction that
lies in
and
for all
. On the other hand, we claim that
is uniformly bounded away from zero, this contradicts the finite measure of
by the pigeonhole principle.
To see the uniform lower bound, find an automorphism fixing
that maps
to
. By hypothesis,
, thus there exists a rational
such that the predicate that models
is in
, hence in
. By invariance, the predicate
is in
also, hence by construction of
,
, and the claim follows.
Now we consider the opposite case, in which for all
. Then we run the construction slightly differently: for each
in turn, set
to be a realisation of
, then set
so that
. (This is possible because for any definable set
containing
, the
-definable set
contains
and thus has positive measure, and so the same is true for
; now use saturation.) Then again we see that the
lie in
, have intersection of measure zero, and have measure uniformly bounded from below, and we again obtain a contradiction.
Now we place a group structure on , and obtain a variant of the above result:
Proposition 6 Let
be types in
, with
wide. Suppose that
,
are contained in
-definable sets
such that
has finite measure. Let
be such that the type of
over
is wide. Assume also that the Keisler measure is translation-invariant. Then
is also wide.
Proof: Suppose this is not the case, so that there exists an -definable set
containing
such that
has zero measure. (Initially, one would need two different definable sets containing
, but one can simply take their intersection.) On the other hand, as
is wide,
itself has positive measure. We can place
in
.
By using the fact that wide types over one set of constants can be refined to wide types over larger sets of constants (Lemma 2 from the previous notes), we see that we can recursively construct a sequence with
wide for all
. Since
, we conclude from Lemma 5 that
for all
. On the other hand, the
all have the same measure as
, which is positive. Finally, the
are all contained in
, which has finite measure; this leads to a contradictoin.
This “generic intersection” property of translates of will be important in later arguments when creating near-groups.

As my loyal reader knows, I have no fear when it comes to models with huge numbers of parameters; indeed the ubercalibration project is effectively a fit with hundreds of millions of parameters, and we can prove that we got the global optimum (in the sense that we made sure the problem is guaranteed to be convex). Today Roweis pitched a generalization of all this, in which one creates a very flexible linear model space, where parameters are tied to or in a hierarchy of meta-data, such that some parameters are tied to, say, the date, some to the airmass, some to the seeing, some to the camera column, and so on. Then the model discovers which parameters are necessary for accurate modeling of the data and thereby discovers important meta-data, dependencies of the data on artificial issues, and bad data. We tentatively agreed to run this on the new BOSS spectra from SDSS-III.
Iain Murray gave another nice talk today, this time in the machine learning group, about a high-end sampling method called elliptical slice sampling
, optimized for gaussian process modeling, where calls to the prior probability distribution function are more expensive than likelihood calls. It was a very nice talk and got me thinking about slice sampling in general, which might be very useful to us.
Another one of the fundamental properties of a chaotic system is dense periodic orbits. It's a bit of an odd one: a chaotic system doesn't have to have periodic orbits at all. But if it does, then they have to be dense.
The dense periodic orbit rule is, in many ways, very similar to the sensitivity to initial conditions. But personally, I find it rather more interesting a way of describing key concept. The idea is, when you've got a dense periodic orbit, it's an odd thing. It's a repeating system, which will cycle through the same behavior, over and over again. But when you look at a state of the system, you can't tell which fixed path it's on. In fact, miniscule differences in the position, differences so small that you can't measure them, can put you onto dramatically different paths. There's the similarity with the initial conditions rule: you've got the same basic idea of tiny changes producing dramatic results.
Read the rest of this post... | Read the comments on this post...
- I dump lots of files I want to read on it using an application called filemagnet. Very useful. See here. In this case I'm looking at the pdf output from the keynote presentation software I use. ) I'm making a few notes on changes I'll make when I get back to my hotel. I'm also reviewing a few other documents that were sent to me for response, and tying off various loose ends here and there before the evening ends.
I sometimes worry that it might be considered rude that I'm apparently doing a bit of work and so not fully paying attention to the food, but I'm not too worried, really. Once the actual food arrives I've been quite appreciative of it, giving it my full [...]I have a lab all morning, so I won't get to more substantive blogging before this afternoon. The Yankees won their 27th World Series title last night, though, and given their status as the most polarizing team in baseball, this seems like a good excuse for a poll:
<a href="http://answers.polldaddy.com/poll/2213713/">What do you think about the Yankees winning the World Series?</a><span style="font-size:9px;">(<a href="http://www.polldaddy.com">surveys</a>)</span>
Choose only one.
Read the comments on this post...It was reported last week by Dennis Overbye at the New York Times that the LHC is only going to reach a center-of-mass energy of 2.2 TeV (i.e. an energy of 1.1 TeV per beam) before the winter shutdown. I was asked about this previously, and at the time I thought it was “in a schedule somewhere.” After looking around, though, it’s much less clear to me where the information actually come from — maybe I heard it from people who had read the New York Times blog, and maybe Overbye originally learned it from magical time-traveling Higgs Bosons! So we might have to demote the whole thing to the category of rumor — but it’s a rumor that appeared in the news, so I can certainly say what I’d think about it if it were true.
If indeed the LHC only achieves an energy of 2.2 TeV by the time it’s shut down in mid-December, some might be tempted to characterize it as a serious setback or defeat; in fact, it’s nothing of the kind. Here’s what’s really going on: the LHC is in the middle of an ongoing start-up process, and has to take a quick break in December and January, but will then pick up where it left off. That means that the point the accelerator startup happens to reach before shutdown doesn’t mean anything special at all — the really important thing is where it gets to when the process continues next year.
What we do know from CERN is that there are three stages of LHC magnet commisionning: to 2000, 4000, and 6000 amps of current. We also know that reaching 2000 amps “allow[s] the passage and guidance of beams at about 1.2 TeV” (which sounds close enough to the 1.1 TeV figure to be rounding uncertainty). So if there were only time to commission to 2000 amps before the end of the year, that could certainly explain the limited beam energy.
Is running at 2.2 TeV good for the physics program? Oddly enough, if we only have a few days of running, there are two lower energies that would be more fun for physicists. The best option might be to continue at the energy at which the center-of-mass energy achieved by the previous accelerator stage, 0.9 TeV. This is the energy that the first collisions will happen at, and longer running at a single energy would let those of us who work on the detectors get a better handle on how our data looks. (For example, I could attempt to do an early, quick version of my track jet analysis. In fact, I’ll try to do that no matter what; it will be good practice, if nothing else.) Another option might be to run at exactly 1.96 TeV, which is the energy of the Tevatron accelerator at Fermilab; that would give us a rare chance to look at the differences between proton-proton and proton-antiproton collisions at the same energy.
But the physics program isn’t the top priority this year, it’s getting the LHC fully up and running. Whatever the rumors say, we don’t yet know how far accelerator commissioning will get this year. Even 2.2 TeV would be enough to make the LHC the highest-energy collider in the world, which is an accomplishment to be proud of. No matter what, there will be much more to do next year, and we can start making discoveries! — Seth
It's not getting as much press as the "X Prize" for private rocket launches, but NASA has quietly been running a contest for work toward a "space elevator," offering up to $2 million for a scheme to transmit power to a small robot climbing a 1km cable. Yesterday, the team from LaserMotive, including certified rocket scientist and friend of the blog Jordin Kare, successfully powered a robot up a 900m cable using diode laser arrays to send power to solar panels on the robot. They managed an average speed of 3.73 m/s, which doesn't get them the full $2 million prize, but qualified them for the $900,000 prize for an average speed above 2 m/s.
There are still two other teams in the competition, which will continue today. The Kansas City Space Pirates team got within 50 m of the top, but not fast enough to be in the money, and a team from the University of Saskatchewan has yet to compete at all. LaserMotive can apparently take another crack as well, and over on Twitter, Mary Kay Kare says that Jordin has some ideas of how to speed their climber up, and go for the grand prize.
This is a long, long way from an actual space elevator, of course-- working out the power beaming is arguably less of a challenge than finding a material to build the thing out of, which nobody has come close to doing. Still, it's cool to see, and a reminder that while rockets are flashier, there's work going on on other ways to get stuff into space in an economical manner.
Read the comments on this post...
Brian Cox explains in this 3 minute talk from Feb 2009 what went wrong at CERN last year, and why continuing this work, despite setbacks, is worth it.
It has been over a year since that accident, and many repairs and improvements have been made including:
This has made me more confident, but at the same time, the problem we hit happened just after one week of running – maybe there are problems we would have hit only after a few weeks of running, or a few months?
We don’t know everything that might happen, so I’m glad we are playing it safer this time.
As the previous discussion on displaying mathematics on the web has become quite lengthy, I am opening a fresh post to continue the topic. I’m leaving the previous thread open for those who wish to respond directly to some specific comments in that thread, but otherwise it would be preferable to start afresh on this thread to make it easier to follow the discussion.
It’s not easy to summarise the discussion so far, but the comments have identified several existing formats for displaying (and marking up) mathematics on the web (mathML, jsMath, MathJax, OpenMath), as well as a surprisingly large number of tools for converting mathematics into web friendly formats (e.g. LaTeX2HTML, LaTeXMathML, LaTeX2WP, Windows 7 Math Input, itex2MML, Ritex, Gellmu, mathTeX, WP-LaTeX, TeX4ht, blahtex, plastex, TtH, WebEQ, techexplorer, etc.). Some of the formats are not widely supported by current software, and by current browsers in particular, but it seems that the situation will improve with the next generation of these browsers.
It seems that the tools that already exist are enough to improvise a passable way of displaying mathematics in various formats online, though there are still significant issues with accessibility, browser support, and ease of use. Even if all these issues are resolved, though, I still feel that something is still missing. Currently, if I want to transfer some mathematical content from one location to another (e.g. from a LaTeX file to a blog, or from a wiki to a PDF, or from email to an online document, or whatever), or to input some new piece of mathematics, I have to think about exactly what format I need for the task at hand, and what conversion tool may be needed. In contrast, if one looks at non-mathematical content such as text, links, fonts, non-Latin alphabets, colours, tables, images, or even video, the formats here have been standardised, and one can manipulate this type of content in both online and offline formats more or less seamlessly (in principle, at least – there is still room for improvement), without the need for any particularly advanced technical expertise. It doesn’t look like we’re anywhere near that level currently with regards to mathematical content, though presumably things will improve when a single mathematics presentation standard, such as mathML, becomes universally adopted and supported in browsers, in operating systems, and in other various pieces of auxiliary software.
Anyway, it has been a very interesting and educational discussion for me, and hopefully for others also; I look forward to any further thoughts that readers have on these topics. (Also, feel free to recapitulate some of the points from the previous thread; the discussion has been far too multifaceted for me to attempt a coherent summary by myself.)
Posted in non-technical, question Tagged: html, mathematical formatting, MathML
Jack is looking at Anne, but Anne is looking at George. Jack is married, but George is not. Is a married person looking at an unmarried person?
A) Yes.
B) No.
C) Cannot be determined.
This is from this month’s Scientific American — article unfortunately costs money. It’s about “dysrationalia,” which is what happens when people with nominally high IQ’s end up thinking irrationally. A phenomenon I’m sure we’ve all encountered, especially in certain corners of the blogosphere.
And the answer is the first option. But over 80 percent of people choose the third option. Here’s the solution: the puzzle doesn’t say whether Anne is married or not, but she either is or she isn’t. If Anne is married, she’s looking at George, so the answer is “yes”; if she’s unmarried, Jack is looking at her, so the answer is still “yes.” The underlying reason why smart people get the wrong answer is (according to the article) that they simply don’t take the time to go carefully through all of the possibilities, instead taking the easiest inference. The patience required to go through all the possibilities doesn’t correlate very well with intelligence.
Today is my turn in our Seminar on A Survey of Elliptic Cohomology.
I attempted to write a survey of some central ideas in Jacob Lurie’s Structured Spaces.
You can find it here: Notions of Space.
You may think of this post also as a continuation of our discussion about Comparative Smootheology I II III.
The various languages and formats that make up modern web pages (HTML, XHTML, CSS, etc.) work wonderfully for most purposes, but there is one place where they are still somewhat clunky, namely in the presentation of mathematical equations and diagrams on web pages. While web formats do support very simple mathematical typesetting (such as the usage of basic symbols such as π, or superscripts such as x2), it is difficult to create more sophisticated (and non-ugly) mathematical displays, such as
without some additional layer of software (in this case, WordPress’s LaTeX renderer). These type of ad hoc fixes work, up to a point, but several difficulties still remain. For instance:
There are a number of extensions to the existing web languages that have been proposed to address some of these difficulties, the most well known of which is probably MathML, which is used for instance in the n-Category Café. So far, though, adoption of the MathML standard (and development of editors and other tools to take advantage of this standard) seems to not be too widespread at present.
I’d like to open a discussion, then, about what kinds of changes to the current web standards could help facilitate the easier use of mathematical displays on web pages. (I’m indirectly in contact with some people involved in these standards, so if some interesting discussions arise here, I can try to pass them on.)
Posted in non-technical, question Tagged: html, mathematical formatting, MathML
This weekend we’re having a meeting of the American Mathematical Society here at Riverside. Julie Bergner and I are running a special session on Homotopy Theory and Higher Algebraic Structures, and there will also be two special sessions on knot theory, one run by Alissa Crans and Sam Nelson. It should be fun! And it’s starting already: Khovanov will be giving a colloquium talk today.
But I’m giving a talk in another session — the session on History and Philosophy of Mathematics, run by Shawnee McMurran and James J. Tattersall. Shawnee was a grad student here at UCR back when I first arrived.
My talk is not very profound or professional, but I hope it’s at least fun:
It’s designed to look best in full screen mode, at least on my small laptop.
As usual, comments and corrections are eagerly awaited! I hope to keep delving into these issues as the years go by. I’m already trying to recruit my Scottish friends to investigate the mysterious stone balls at the National Museum of Scotland in Edinburgh and the Glasgow Art Gallery and Museum. And I’m going to find out more about Scholium 1 in Book XIII of Euclid’s Elements.