More on Frisch

February 12, 2012

More puzzlement about something in Frisch’s book. On p. 17:

These is disagreement between those who think that the domains of applicability of all scientific theories are essentially limited and those who have a hierarchical conception of science according to which the ultimate aim of science is to discover some universal theory of everything. My claim that classical electrodynamics is (and to the best of our knowledge will remain) one of the core theories of physics is independent of that debate… For, first, we need to distinguish between the content of a theory and our attitude toward that content. Even if we took the content of classical electrodynamics to be given by universal claims with unrestricted scope about classical electromagnetic worlds, our attitude can be that we accept or endorse those claims only insofar as they concern phenomena within the theory’s domain of validity.

Unlike Frisch, I do not see the point (other than the obvious rhetorical one of avoiding terminological disputes) of distinguishing between the content of a theory and our attitude towards it. Under Frisch’s account, that part of the content of the theory that encompasses claims about what the entire universe is like is irrelevant to our knowledge of the physical world, because we accept only claims of the theory that fall within the theory’s domain of validity. What, then, is the purpose of attributing these useless claims, which have no bearing on our knowledge of the world, to the content of the theory?

It may sound here like I am just arguing about terminology, but I do think this particular use of language, which is deeply entrenched in philosophy of physics, is pernicious. It is pernicious because by elevating some particular construal of the mathematical structure of the theory to be straightforwardly “the content” of the theory, one is led to an overemphasis on the mathematical structure as being informative about the world, while overlooking the informativeness inherent in how that structure fails to perfectly map onto the world. This terminology in effect provides a licence to focus solely on formal features of the mathematical structure as being the “essence” of the theory without seriously considering how the structure hooks on to the actual world, as opposed to how it hooks on to the philosopher’s possible worlds.


Frisch on reliable theories

February 12, 2012

There’s a really confusing passage in p. 42 of Mathias Frisch’s book on inconsistency in classical electrodynamics. He suggests, in response to the “problem” of inconsistency in classical electrodynamics, that we modify our account of theory acceptance:

this problem disappears if in accepting a theory, we are committed to something weaker than the truth of the theory’s empirical consequences. I want to suggest that in accepting a theory, our commitment is only that the theory allows us to construct successful models of the phenomena in its domain, where part of what it is for a model to be successful is that it represents the phenomenon at issue to whatever degree of accuracy is appropriate in the case at issue. That is, in accepting a theory we are committed to the claim that the theory is reliable, but we are not committed to its literal truth or even just of its empirical consequences. This does not mean that we have to be instrumentalists. Our commitment might also extend to the ontology or the ‘mechanisms’ postulated by the theory. Thus, a scientific realist might be committed to the reality of electrons and of the electromagnetic field, yet demand only that electromagnetic models represent the behavior of these ‘unobservables’ reliably, while an empiricist could be content with the fact that the models are reliable as far as the theory’s observable consequences are concerned.

If acceptance involves only a commitment to the reliability of a theory, then accepting an inconsistent theory can be compatible with our standards of rationality, as long as inconsistent consequences of the theory agree approximately and to the appropriate degree of accuracy… our commitment can extend to mutually inconsistent subsets of a theory as long as predictions based on mutually inconsistent subsets agree approximately.*

What confuses me about this is that I do not know what Frisch could mean by a theory being reliable apart from its consistently producing predictions that agree with experiment. Frisch wants to avoid instrumentalism by claiming that in accepting a theory, all we are committed not just to the observable consequences of the theory, but also possibly to the reality of the ontology and mechanisms of the theory. That is, in accepting the theory of electrodynamics, we might also be committed to the claim that electromagnetic models represent the behavior of ‘unobservables’ like ontology and mechanisms reliably. But what does it mean to represent reliably, apart from being a representation that reliably leads to predictions that agree with experiment? What does Frisch mean in the excerpt above by “represents the phenomenon at issue to whatever degree of accuracy is appropriate”? How can degrees of accuracy be attributed to representations over and above the accuracy of their experimental predictions?

Incidentally, I’m appalled at how expensive Frisch’s book is now. I bought it for $9 on Amazon when OUP slashed prices after having decided to stop printing it. Now it costs $60. The Kindle Edition costs $53.72!

* Frisch, M. (2005). Inconsistency, Asymmetry, and Non-Locality: A Philosophical Investigation of ClassicalElectrodynamics. Oxford University Press, USA.


What is a ‘fundamental theory’?

March 20, 2011

‘Fundamental theory’ is a phrase that is often used by philosophers and scientists alike, but I’m not sure what they mean by that. In this paper, Mathias Frisch writes:

In the case of non-fundamental theories, domain restrictions may also include restrictions to certain length- or energy-scales.

This suggests that fundamental theories are supposed to apply to all length- and energy-scales.

In the paper that Frisch is replying to, Gordon Belot writes:

We will of course need further principles that demarcate the domain of applicability of each theory — but every non-fundamental theory involves such principles.

Do fundamental theories, then, have no such restrictions on their domains of applicability? If so, does that mean that fundamental theories are just ‘theories of everything’? Does it also mean that we have no real examples of fundamental theories for our world, since all of our theories apply only to limited kinds or arrangements of matter?

Elena Castellani gives a quite different account of ‘fundamental physics’:

“fundamental physics” is the physics concerned with the search for the ultimate constituents of the universe and the laws governing their behavior and interactions. Fundamentality, on this view, is the prerogative of the physics of smaller and smaller distances (or higher and higher energies), and, accordingly, particle physics and cosmology are currently identified as the fields where the quest for the “final theory” takes place.

I find the quick move from ‘fundamentality’ to ‘finality’ in her last sentence quite intriguing. But that aside, note that Castellani here takes fundamental physics to refer to the study of the “ultimate constituents” of the universe and their behaviour. This characterisation is quite different from Frisch’s — Castellani’s ‘fundamental physics’ may involve a restriction to certain length scales (the very small). But perhaps she has a notion of ‘fundamental theory’ that is distinct from that of ‘fundamental physics’.

In fact, although they never explicitly state this, the discussion between Frisch and Belot regarding the consistency of classical electrodynamics often refers to a certain subset of the equations of classical electrodynamics as characterising the ‘fundamental theory’, with other subsets being non-fundamental theories of classical electrodynamics. But we know that classical electrodynamics has a limited domain of applicability. So I’m not sure what they mean by ‘fundamental theory’ in this context.

There is also a notion of ‘fundamental’ at work that talks about theories being more or less fundamental than other theories, but may not call any one theory ‘fundamental’ tout court. Stephan Hartmann, for example, characterises one theory as ‘more fundamental’ than another if it covers a broader range of energy scale which includes the energy scales at which the ‘less fundamental’ theory is valid. In the same paper, Hartmann quotes Steven Weinberg as claiming that a more fundamental theory is ‘on a level closer to the source of the arrows of explanation than other areas of physics’. However, this characterisation doesn’t mesh so well with debates in philosophy of science about whether a less fundamental theory can explain phenomena better than a more fundamental theory. If theories are to be defined as more fundamental according to the ‘arrows of explanation’, then it is simply by definition that there are no phenomena that can be explained better by less fundamental theories.

Next, we have Carlo Rovelli, who suggests that a limited domain of validity renders a theory non-fundamental, yet claims that QFT is “the fundamental theory of motion” (p. 257 of the linked book):

General relativity cannot be seen as a ‘fundamental’ theory since it neglects the quantum behavior of the gravitational field, but many of the directions that are explored with the aim of finding a quantum theory of the gravitational field and/or extending the Standard Model — perhaps to a theory of everything — are grounded in QFT. Thus, one may regard QFT as the fundamental theory of motion at our present stage of knowledge — playing the role that Newtonian mechanics and its Lagrangian and Hamiltonian extensions played in the eighteenth and nineteenth centuries.

Rovelli seems to be implying here that a ‘fundamental’ theory is one that proves to be fruitful in a large number of ways — at least that’s what I take his comparison with Newtonian mechanics, and his quip about the current role of quantum field theory, to imply. Yet he disqualifies GR as a fundamental theory on account of its neglect of quantum behavior.

So, what is a fundamental theory?


Metaphysics and Effective Theories

February 11, 2011

I’ve lately been finding it really difficult to get myself interested in alleged metaphysical issues stemming from assuming that a certain physical theory applies to the entire universe. Many of the ‘philosophical’ problems of quantum mechanics, for example, are of this ilk, as are many in algebraic quantum field theory and general relativity.

The standard practice in philosophy of physics is to use a so-called fundamental theory for your metaphysical inferences. Effective theories are for the most part ignored. The intuition is that the ‘fundamental theory’ describes ‘what is really out there’ while effective theories are somehow more phenomenological, or derivative of the fundamental theory (and hence need not be considered in addition to the fundamental theory).

A question arose in class today as to why is it that people who worry about entanglement in quantum mechanics typically worry about it in relation to special relativity rather than general relativity. The standard answer is that gravity is an effect that is negligible in the entanglement experiments we are considering, so we do not have to worry about what a quantum theory of gravity would have to say about the issue.

That got me wondering about how far someone could use that answer and still maintain that it is useful to figure out what the metaphysics of our world is by supposing that quantum mechanics applies to the entire universe. My worry is this. By using the ‘gravity is negligible’ reason, one is admitting that quantum mechanics is really just another effective theory — it has a limited domain of application. If so, then either

  1. One thinks that in general it is legitimate to derive metaphysical conclusions using effective theories, or
  2. One thinks that there is something special about quantum mechanics as an effective theory, which allows one to derive metaphysical conclusions from it, as opposed to other effective theories that are typically ignored (e.g. effective field theories).

If one goes with 1., then it seems to me that the right way to do scientifically-informed metaphysics is to take the various theories we have as each being informative about their respective domains of applicability. This has the implication that we should not be applying quantum mechanics to the entire universe and taking the metaphysical implications of that seriously. For it is classical theories that are most effective at large size scales, not quantum mechanics.

As for 2., I am still struggling to imagine what could be special about quantum mechanics that licenses us to treat it in a different way from other effective theories. One possible reason is that one thinks that the most important aspects of quantum mechanics will still persist in a ‘final theory’ which applies to the entire universe. But whatever these preserved aspects are, it’s not clear to me that they are the same aspects as those that lead to the traditional philosophical problems in quantum mechanics. It might be that the mathematics of the final theory is such that the problems with locality and whatnot that manifest themselves in quantum mechanics are somehow dodged. One can apply the same consideration to other issues in philosophy of physics. Maybe underdetermination in general relativity won’t actually translate to an underdetermination problem in the final theory.

In addition, if you look at the history of physics, it doesn’t seem to me that the aspects of older theories that are preserved in newer theories are those which tend to preserve philosophical problems in the older theories. It doesn’t seem to me as though any of the aspects of classical physics that are preserved in quantum mechanics are those that are philosophically problematic for either theory. In other words, the robust aspects of physical theories often aren’t those that lead to traditional philosophical problems.

Someone help me out here. I seem to be missing out on a lot of fun by being so pessimistic about this enterprise of reading metaphysics off ‘fundamental’ theories.


Epistemic opacity in simulations

January 10, 2011

This post is the result of reading Wittgenstein and the philosophy of simulation literature in close temporal proximity.

Here is Paul Humphreys on epistemic opacity in computer simulations:

a process is epistemically opaque relative to a cognitive agent X at time t just in case X does not know at t all of the epistemically relevant elements of the process. A process is essentially epistemically opaque to X if and only if it is impossible, given the nature of X, for X to know all of the epistemically relevant elements of the process. For a mathematical proof, one agent may consider a particular step in the proof to be an epistemically relevant part of the justification of the theorem, whereas to another, the step is sufficiently trivial to be eliminable. In the case of scientific instruments, it is a long-standing issue in the philosophy of science whether the user needs to know details of the processes between input and output in order to know that what the instruments display accurately represents a real entity.

The charge is that simulations bring something new to philosophy of science because they are epistemically opaque, unlike, say, the process of solving an equation analytically.

However, I’m not sure I understand how simulations are any more epistemically opaque than physical experiments or non-automated calculations in mathematics. First, consider experiments. It seems to me that the checks we make to ensure that the results of experiments are reliable are almost completely analogous to those we make to ensure that the results of simulations are reliable. Allan Franklin has a good list of the kinds of checks we make to ensure that experiments produce reliable results. All the seven criteria he describes there seem to be used to validate simulations as well as physical experiments. We do check that the simulation reproduces known results and artifacts. We do try to eliminate plausible sources of error. If the simulation produces a striking pattern that can’t be explained by plausible sources of error, we do use that pattern itself to argue for the validity of that pattern as a legitimate result. If multiple independently corroborated theories account for the results of a simulation, that does add to the validity of the results. Simulations are often based on well-corroborated theories. Finally, statistical arguments are used to argue that patterns seen in simulations are real.

So what is epistemically relevant in simulations that humans cannot know, that can be known in the case of physical experiments and mental or pen-and-paper mathematical calculations? I’m guessing that what Humphreys takes to be epistemically relevant in simulations but inaccessible to human knowledge is something like the results of each computational step in the simulation, or whether the mechanistic workings of the simulating apparatus produces mathematically correct results. But are the results of each computational step epistemically relevant? Here is one reason to think not. In a physical experiment, one never has a complete working theory of the apparatus that tells us the exact consequences of every step in the experiment. It seems to me that demanding that the result of every computational step in the simulation be epistemically accessible to humans is analogous to demanding that every step in the experiment be justified by a theory that describes every aspect of the apparatus.

What if Humphreys considers the reliability of the simulating apparatus, that is, whether it is producing mathematically correct results, as the epistemically relevant aspect of simulations that is essentially inaccessible to humans? As noted above, the same way one can validate the reliability of experiments without having a complete theory of the experimental setup, we have ways of validating the reliability of simulations. But they are not foolproof of course. Suppose we take seriously the possibility that our methods of validation still leave out epistemically relevant information. It is possible that even though our checks show that the results are reliable in a large variety of situations, some hocus-pocus is going on which can be discovered only by going through every single step in the simulation, which humans cannot do. But there is an analogous “problem” when it comes to mental or pen-and-paper arithmetic. One’s belief that one is calculating 2098×98723 correctly, if one is doing it for the first time, is based on one’s past success in calculating various other things correctly. Of course some hocus-pocus could be going on just this time, for the new calculation, a kind of hocus-pocus which did not show itself in previous calculations. But this possibility does not lead us to say that there is something epistemically missing from the new calculation. If one really wants to be paranoid, one could always doubt the results of mental or pen-and-paper calculations, because after all we do not know, mechanistically, how the human mind consistently applies arithmetical rules, and whether it always correctly applies them. We act as though it always consistently applies them because of prior evidence of its reliability, but these do not suffice to ensure with certainty that it will always consistently apply them. How is this different from the case of simulations? In simulations, we also only have the prior results of simulations, and the backing of mathematics and physical theories relating to the mechanics of the simulation, to assure us that this time the simulation will also be reliable.

Humphreys’ ascription of epistemic opacity to machine calculations but not human calculations is an interesting inversion of one point of view that Wittgenstein discusses at various points in his philosophy of mathematics. Wittgenstein identifies the philosopher of mathematics’ love for axiomatic reductions of mathematics due to the idea of “mechanical insurance against contradiction” (RFM, p. 107e, his emphasis). The idea is that by reducing mathematics to a set of rules that even a machine can follow, one excludes mistakes from mathematics:

We may trust ‘mechanical’ means of calculating or counting more than our memories. Why? — Need it be like this? I may have miscounted, but the machine, once constructed by us in such-and-such a way, cannot have miscounted. Must I adopt this point of view? — “Well, experience has taught us that calculating my machine is more trustworthy than by memory. It has taught us that our life goes smoother when we calculate with machines.” But must smoothness necessarily be our ideal (must it be our ideal to have everything wrapped in cellophane? (RFM, 106e)

ResearchBlogging.org
Humphreys, P. (2008). The philosophical novelty of computer simulation methods Synthese, 169 (3), 615-626 DOI: 10.1007/s11229-008-9435-2
Wittgenstein, L. (1967). Remarks on the Foundations of Mathematics, ed. G. H. von Wright, R. Rhees, and G. E. M. Anscombe, trans. G. E. M. Anscombe. MIT Press.


Formalist Analogies in Statistical Mechanics

July 31, 2010

I recently read Mark Steiner’s neat little book on the applicability of mathematics in physics. His main thesis is that the ways in which mathematics is successfully applied in physics are often anthropocentric. He takes this as a strike against naturalism.

One example of anthropocentric reasoning he identifies is the use of what he calls formalist analogies in the discovery/construction of new theories. One example that bugs many philosophers is “quantization”, where the quantum mechanical description of a system is derived by considering its classical description and replacing classical observables with quantum mechanical operators. This technique was introduced by Heisenberg, whose heuristics I expressed amazement at in my last post. There are problems with this heuristic, such as how to deal with descriptions containing products of classical variables, given that quantum mechanical operators don’t necessarily commute, but there are some standard workarounds that seem to work for most cases. Those problems are not really the issue Steiner is getting at, though. His issue is that the matrices Heisenberg uses to replace classical observables “have no independent physical meaning”; they are mere formalisms. The matrix equation one gets from a quantization of a classical equation is parasitic on the classical equation, which is itself “false” according to quantum mechanics. This lack of independent physical meaning, Steiner argues, means that we are not entitled to use induction to infer that since quantization works in certain model cases, it will work for all cases.

I’ve always been disturbed by the approach to statistical mechanics that makes use of Gibbsian ensembles, particularly the grand canonical ensemble. Part of my discomfort with it may be because many textbooks introduce them using formalist analogies, in Steiner’s sense. Gibbs himself was thoroughly instrumentalist about the ensembles and did not ascribe any physical meaning to them, but modern textbooks are liable to be more cavalier about physical meaning. Thus, one often finds them treating the ensembles as more than just a calculational technique, which I think treads into formalist analogy territory.

Gibbs emphasizes throughout his classic monograph that his ensembles are purely imaginary and meant to make calculations easier. However, later textbooks have a tendency to try to justify the use of ensembles by a mixture of physical and formal analogies. For example, textbook authors often speak of the equilibrium ensemble derived from combining two grand canonical ensembles (Tolman is one example). They take the resultant ensemble as the equilibrium that would result from combining a representative from one ensemble with that from another. Taken merely as a calculational tool, this is unproblematic. The formalistic reasoning comes into play when the outcome of the interaction of ensembles is straightforwardly taken, without further justification, to represent the outcome of the interactions of actual systems. For, as in the quantization analogy, the ensemble is parasitic on the actual system for its physical relevance. When two systems interact, we do not have two ensembles interacting. So there is no physical case to be made that the outcome of the ensembles’ interaction also represents that of the systems’ interaction. Just as the success of quantization in a few cases doesn’t seem to give us reason to expect a successful induction to all cases, the success of representing a system on its own with an ensemble doesn’t seem to give us reason to expect a successful induction to cases where multiple systems interact.

Certainly there exist authors who are more careful about using ensembles. Fowler for example justifies their use not by claiming that the entire ensemble represents the system of interest, but rather that the system is itself a small part of some larger system that has the characteristics of an ensemble. This makes more physical sense, but it means checking for more physically realistic conditions that your system of interest must fulfill, before applying ensemble methods. In Fowler’s case, he requires that the system of interest is one of many subsystems of a large ensemble-like system, where the subsystems exchange only small amounts of energy — small compared to the total energy of the ensemble. At the same time, however, their interactions with one another must be significant enough to allow the entire ensemble to attain an equilibrium state.

Newer textbooks, however, have a tendency to simply introduce Gibbsian ensembles, without checking for physical sense and the restrictions that must accompany them, and “justifying” them with mere formal analogies. One wonders what the point of such “justifications” is — I prefer Gibbs’ honest admission that he introduces ensembles only because they give him the correct answers.

Even more annoying are cases where a “justification” for using an ensemble is introduced with reference to a realistic physical model, but the ensemble is then used for examples where the physical conditions in that model, the conditions that were relevant to the justification, do not hold! For example, Pathria introduces the grand canonical ensemble by considering a system exchanging particles and heat with a large reservoir. However, all the problems he next considers, to which the grand canonical ensemble is applied, are cases where particle number is conserved! The only exception is an example of adsorption of particles on a surface, which appears as an exercise at the end of the chapter. We know that the grand canonical ensemble gives us the right answers even for systems that have a constant number of particles because for many-particle systems, the equilibrium ensemble contains is composed mostly of systems with the “equilibrium” number of particles, so when you average over the ensemble to get the equilibrium number, systems with a non-equilibrium number of particles contribute nearly nothing to the average. But this merely justifies the grand canonical ensemble as a calculational trick and is wholly separate from the physical model that was used to justify the ensemble method, a model whose salient features were then thoroughly ignored when the ensemble method was applied to other systems.


Landau on the second law

June 7, 2010

I was browsing through a collection of Landau’s papers and came across one of the more bizarre explanations of the second law of thermodynamics that I’ve seen. The few people who frequent this blog probably don’t need this explained to them, but the reason why many philosophers think that the second law needs ‘explaining’ is that the fundamental laws of classical and quantum mechanics are time symmetric, but the second law is not. If we think that all physical phenomena are completely described by the fundamental laws, then it seems puzzling that one can observe large scale time asymmetry when said laws are time symmetric.

Another way of stating that puzzle is that if one is given an isolated system in a state of less-than-maximal entropy, then standard applications of statistical mechanics lead to the conclusion that the system is likely to be found in a state of higher entropy in both directions in time — the past and the future. Landau accepts this as well. He then takes the problem of explaining observed monotonic change in time (a popular interpretation of the second law) to be one of explaining why all isolated subsystems seem to undergo monotonic change in the same direction. He offers the following argument:

If a statistical body is isolated from its surroundings for a period which is not too large compared with the relaxation time,* its entropy changes monotonically. If two such bodies are isolated, then the monotonic changes cannot occur in mutually opposed directions. This is immediately clear, since these two systems can be connected by an arbitrarily weak coupling into one single system, at any rate when their periods of isolation are simultaneous or partly so. If we apply the proof introduced above of the monotonic entropy change in this case of two weakly linked parts of one system, then we can see immediately that the directions of the entropy changes of the parts cannot be opposed to each other. If different systems are not simultaneously isolated from their surroundings, then we can introduce to the argument any necessary intermediary steps. We can now see that either the entropy of every statistical body which is not isolated from the universe for too long is constant, or there is a particular universal time direction for entropy increase.

This is weird since the clause about ‘relaxation time’ (see footnote) is a provision for the [supposed] fact that on time scales larger than the relaxation time, there isn’t a monotonic increase in entropy. So the ‘universal time direction’ would in this argument hold only for the system’s relaxation time. Yet, it’s also supposed to hold for all isolated systems that could be potentially coupled to the system in consideration! So, all such potentially coupled systems will undergo monotonic increases in entropy during the same time scale as our model system’s relaxation time!

It gets more bizarre. Recall that in the passage above, Landau concludes that either

  1. ‘the entropy of every statistical body which is not isolated from the universe for too long is constant’, or
  2. ‘there is a particular universal time direction for entropy increase.’

Landau opts for 2. Here’s how:

Everyday experience shows, however, that [1.] is not the case. This alone is quite sufficient (even without Bohr’s hypothesis that the inner layers of the stars may only be described in terms of relativistic quantum theory) to conclude that there are at least regions in the world which do not obey the statistics.** This makes the two directions of the time axis non-equivalent; the direction which is associated with the increase of the entropy of a system isolated over not too long a period form its surroundings can be defined as the future. For classical mechanics the conceptions of past and future are entirely without meaning. Only then do we obtain the second law of thermodynamics as the law of increase of entropy. This laws, as well as the existence of past and future observed in everyday experience, is only possible because the world as a whole does not obey the laws of thermodynamics.

This is the first time I’ve seen a claim like this: that entropy increases monotonically with time only because the laws of thermodynamics don’t apply to the entire universe! I must admit that because of the problems mentioned above, I can’t take any of Landau’s suggestions seriously, but the paper was a highly amusing read, and very reflective of how physicists tend to be much more imaginative than philosophers of physics.

*This clause about ‘relaxation time’ is supposed to get around the notion that the evolution of a system’s entropy with time should appear time-symmetric on the large scale, as a series of mostly small fluctuations away from equilibrium and back again. If we look at the system on a time scale close to the relaxation time, then we should be ‘within’ that portion of a fluctuation that involves monotonic increase or decrease of entropy with time.

**Landau states earlier in the paper that ‘in relativistic conditions… thermodynamic considerations [are] no longer valid’, and that he does not ‘expect thermodynamics to be upheld in the case of relativistic quantum theory’.

ResearchBlogging.orgM. Bronstein, & L. Landau (1933). Über den zweiten Wärmesatz und die Zusammenhangsverhältnisse der Welt im Großen Physikalische zeitschrift der Sowjetunion, 4


Sean Carroll’s From Eternity to Here

March 6, 2010

I recently finished reading Sean Carroll’s new book, From Eternity to Here. Peter Woit opined that the book was “an extended essay in the philosophy of science” and was curious to know what philosophers would think of it, so I thought I’d give it a shot. While the questions that Carroll addresses are questions that are discussed by philosophers of physics, the book lacks good arguments for the conclusions that Carroll draws. He makes many sweeping claims, many of which have been disputed by philosophers, but he makes no engagement with those disputes. The arguments he provides are not new in the literature and are much better defended elsewhere. So it’s certainly not a new contribution to the philosophy of physics literature, and if it is philosophy of science, is philosophy of science badly done.

First, a short sketch of Carroll’s argument. Carroll essentially takes David Albert’s line that the tension between the time-asymmetric second law of thermodynamics and the time-symmetric laws of classical mechanics can be resolved by postulating low-entropy initial conditions for the universe — a postulate that has come to be called the Past Hypothesis in the philosophical literature. On top of that, Carroll argues that those initial conditions themselves need explanation, and the explanation he favours is a multiverse that buds off little “baby universes” that start off with low entropy initial conditions, We, Carroll suggests, are in one of those baby universes.

Carroll often makes a major philosophical point without sufficient justification and without addressing some obvious objections. Most but not all of the following instances where he does this are points are important for his main argument.

  1. Explanation of “special” initial conditions of the universe. Craig Callender objected in his review of Carroll’s book to Carroll’s assumption that “special” initial conditions have to be explained at all. Callender himself argued this with more detail in a paper, and the issue of whether explanation is needed in this case is a matter of debate in philosophy of science. But Carroll doesn’t justify his assumption at all.
  2. The Past Hypothesis. As mentioned, Carroll’s argument for the Past Hypothesis is basically David Albert’s. This argument has been criticized in various places. To his credit, in this case Carroll does mention the existence of those criticisms, in footnote #142, where he admits that the status of the Past Hypothesis is “not uncontroversial”. However, he does not engage with those criticisms.
  3. The Past Hypothesis as a law of nature. Carroll states that there is a distinction in physics between laws and initial conditions, and that since the Past Hypothesis is an instance of the latter, it is not a law of nature. But this does not address the reason supporters of the Past Hypothesis think the Past Hypothesis is a law of nature. The reason David Albert, Barry Loewer and a few others think that the Past Hypothesis is a law of nature is because they hold a Humean, Mill-Ramsey-Lewis view of what the laws of nature are, and think that the Past Hypothesis would appear in the system of statements about the world that forms the best balance of simplicity and informativeness.* Carroll’s statement about laws versus initial conditions completely ignores the [prima facie, not unreasonable] view of laws of nature held by these people. Carroll’s objection to calling the Past Hypothesis a law of nature is premised on a different view of what the laws of nature are, a view that is neither articulated nor justified.**
  4. The relation between information theoretic entropy and thermodynamic entropy. Carroll repeats Landauer and Bennett’s arguments for equating information theoretic entropy and thermodynamic entropy. These arguments have been criticised, and Carroll does not mention these criticisms. The equation of those two kinds of entropy is put forward as unproblematic when it is not.
  5. The different arrows of time. Carroll frequently conflates the different arrows of time and makes big, unsubstantiated claims based on those conflations. A series of such claims is presented with hardly any support on pp. 40-41. Among the claims are:
    • The arrow of time*** “explains” why we remember the past but not the future.
    • The fact that our memories of the past are reliable is explained by the monotonic increase of entropy with time.
    • “We distinguish past from future through the relationship between cause and effect.” (Isn’t it the other way around?)
    • From the previous point, he thinks it follows that “part of the distinction we draw between ‘effects’ and ’causes’ is that ‘effects’ generally involve an increase in entropy”. (I don’t understand. How do causes not generally involve an increase in entropy? How are causes somehow exempt from the second law of thermodynamics? And what does he mean by “involve”?)
    • “Our notion of free will… is only possible because the past has a low entropy and the future has a high entropy.”

Overall, I think it’s a nice pop science book to read for laypeople who want to get their feet wet in some of the issues in the foundations/philosophy of statistical mechanics, but for those already familiar with the ins-and-outs of that field, and for philosophers in general, Carroll’s book is too sloppy to be of much help.

*See for example this paper by Loewer.

**It seems, among other things, to be a non-Humean view.

***It’s not clear at this point even what Carroll means by the arrow of time.


Reductivism, simulations and materiality

November 11, 2009

One of Jerry Fodor’s arguments against reductivism in his 1974 paper is that bridge laws reducing, say, economics to physics are undermined by the possibility of simulations of economic systems that instantiate the same economic ‘laws’ but have a very different physical basis from economic systems that are composed of interacting humans. It’s difficult to imagine what bridge laws would carry out the reduction successfully for both the simulated economy and the real human economy.

Someone in class pointed out that this would seem to be more of a problem for economics than for something like chemistry. After all, it seems like the physical basis for chemical systems has to be electrons, protons, and so on — the ontology of quantum mechanics, the science to which it allegedly reduces. What other physical basis could there be?

But then there are such things as simulations of chemical systems. If we take these to be parallel to the case with simulated economic systems, then bridge laws relating chemistry to quantum mechanics would have to cover not only the cases where the physical properties are instantiated in an actual physical system, but also those where they aren’t actually instantiated but represented in a computer simulation.

I take it that it would be an unacceptable move to say that the laws of chemistry don’t apply to simulated chemical systems, and in any case it’s difficult to see why one should be allowed to say that and not allowed to say that the laws of economics don’t apply to economic systems that aren’t composed of humans.

The only other way at present I can think of to get out of this bind is to say we should somehow regard the chemical simulation as instantiating the physical properties it represents. But the awkwardness in that phrase (what does it matter how we regard it — it either instantiates them or it doesn’t, regardless of our regard) suggests that I can’t find a way to make this sound like a good move. And in any case we run into the same problem: why should we do this for the chemistry case but not for the economics case?

Still, I have this intuition that somehow chemistry is essentially about actual atoms and molecules, while economics is more removed from the material nature of the systems it is applied to. But I can’t think of a way to justify it.


Taking baby physics too seriously

October 28, 2009

I don’t know if this objection to the Dowe/Salmon causal processes account of causation has been raised in the literature. It’s on hindsight fairly obvious, so I would be surprised if it hasn’t.

The notion of a causal interaction is crucial to the causal processes view. In Dowe’s account, a causal interaction is an intersection of two world-lines of causal processes that involve an exchange of a conserved quantity (e.g. energy-momentum).

Part of the motivation of requiring the exchange of a conserved quantity is that that is a requirement that seems to rule out pseudo-processes and that seems to be grounded in physics. But Salmon’s and Dowe’s writings on causal processes are littered with baby physics examples like a ball hitting a window and billiard balls colliding. And it wasn’t until yet another class on causal processes this morning that it struck me that in none of those examples of collisions is there an intersection of world-lines of objects. If you want to talk about particles, none of the world-lines of the constituents of the balls intersect: the ‘collision’ is simply a matter of the electrostatic forces between those constituents preventing them from getting too close! This seems obvious, but somehow the use of such homely examples hoodwinked me until I’d had to sit through a summary of the causal processes framework for the fourth or fifth time in the last 8 weeks.

Relaxing the requirement to “approximate” intersections (i.e. close approaches) doesn’t work either. It’d merely open a larger can of worms about what the threshold for when a near approach should be considered an “intersection”. You would need to consider scattering cross-sections and so on, which obviously differ depending on the forces and particles involved.

Seems to me that if you really want to talk about intersections of world-lines, then you need to use the field-theoretic conception of physics. But that brings along a whole other suite of problems like how to individuate objects and processes on that conception, and how, if you call a set of points on a manifold a ‘process’, that may be said to ‘possess’ a ‘conserved quantity’.


Follow

Get every new post delivered to your Inbox.