Friday, February 29, 2008

early mornings

sleep-stuff crusted eyes
crack'd reluctantly by sun:
window faces east

Thursday, February 28, 2008

self control

actions determin'd
deliberately, and yet . . .
in hindsight: doubt

Wednesday, February 27, 2008

ignorant peers

"too good to listen"
like picnic flies, swarming 'round
spouting nonsense: why?

Monday, February 25, 2008

tea bags

Tea, like any organic product, varies wildly in quality across several parameters. One parameter, of course, is the species / breed of the tea plant; a second is the tending / harvesting of the tea; a third is the preparation of the harvested leaves for sale; and a fourth is the preparation of the purchased tea product for consumption.

[Analogously, we might speak of the breed of cow (Angus, Kobe, . . .); the treatment of the cow (its diet, exercise, . . .), the cut of beef (ribeye, filet mignon, . . .); the treatment of the meat (smoked, seasoned, marinated, . . .); and the manner in which the steak is prepared (grilled, BBQ'd, broiled, . . .).]

The tea bag dramatically reduces complications involved in the preparation of tea products for consumption. The would be tea drinker no longer has to either (i) measure out the appropriate amount of tea, or (ii) filter the tea leaves from the water once steeping has finished; both tasks are handily dealt with by the convenient bag.

Nevertheless, the bag brings with it a cost. This cost manifests itself in the restriction of values which can be held by key parameters relevant to overall tea quality.

First, the bag restricts the space in which tea leaves can expand. Yet, expansion of tea leaves is necessary for proper steeping. Furthermore, size of tea leaves is positively correlated with tea quality. Consider, for example, a fine Taiwanese Oolong which comes dry in pellets such as this one:


When properly steeped, however, the leaf expands to reveal its size and shape. Here is the exact same tea leaf after proper steeping, approximately 10x wider and 25x longer than before:


So, high quality leaves cannot steep properly in small bags. Furthermore, the convenience of the tea bag breeds a carelessness in preparation, so consumers often oversteep, or steep a tea at an inappropriate temperature (subtle green teas, for example, lose all their flavor if steeped in boiling water and become bitter if steeped for too long).

De facto, however, it is not just extremely large leaves which are excluded from tea bags. The usual practice is to fill them with tea dust, or very tiny crumbs of tea, the floor sweepings from the processing of quality tea, excluding almost all tea but that of the lowest quality. The economic feasibility of tea bags depends upon their use of such extremely poor-quality (and thus cheap) tea.

The consequence: for the sake of convenience, the lion's share of the tea drinking population drinks substandard tea. First corollary: loose leaf tea is almost impossible to find in any standard grocery stores or coffee shops. Second corollary: to fuel the yuppie drive for more expensive goods, "gourmet" tea bags have begun to appear. Yet, such "gourmet" teas are of far lower quality, and now, even more expensive (per cup) than most loose leaf teas.

Thursday, February 21, 2008

no fish in the tank

Insignificant people must be tolerated in order to keep them well disposed. Then we can make use of them if we should need them. If we become alienated from them and do not meet them halfway, they turn their backs on us and are not at our disposal when we need them. But this is our own fault.

i ching, kou

On psychological grounds I have discarded the idea that we are dealing with mere chance numbers. In a total picture of natural events, it is just as important to consider the exceptions to the rule as the averages. This is the fallacy of the statistical picture: it is one-sided, inasmuch as it represents only the average aspect of reality and excludes the total picture . . . Inasmuch as chance maxima and minima occur, they are facts whose nature I set out to explore.

c g jung, synchronicity

Making an extreme idealization, . . . a person has only one decision to make in his whole life. He must, namely, decide how to live, and this he might in principle do once and for all. Though many . . . have found the concept of overall decision stimulating, it is certainly unrealistic and in many contexts unwieldy.

l savage, the foundations of statistics

Saturday, February 16, 2008

probability and public policy VII: cancer

[part one of this series here]

The variety of interpretations of probability is supplemented by a variety of applications. Unfortunately, the most certain and convenient applications of statistical analysis (those implementable in scientific experiments) don't often provide an answer to our questions about how to act in an uncertain world.

Causal connections, for example, are notoriously difficult to justify: how can we separate the case where smoking causes cancer from that where smoking and cancer share a mutual cause (say, a gene predisposing one to both the practice of smoking and the tendency to contract cancer)? The problem is a vexed one, but it misconstrues the situation. Such questions about causality can always be reduced to the more pertinent: how should I act?

This latter question, the question of what to do, is precisely one the answer to which can never be "read off" statistical analyses. Although there are some cases where the answer may be clear (if analysis determines the probability that you will die if you leap off a 15 storey building to be 98%, you probably shouldn't jump off that building), in most cases, the answer provided by a statistical analysis will be orthogonal to the question of what to do. Some examples:

Threshold Effects

Consider, for example, the question: Does the sugar substitute saccharin cause cancer? In fact, this question is largely irrelevant to the more pertinent: Should I ingest saccharin?

Suppose a study demonstrates that feeding lab mice some substantial amount of saccharin each day dramatically increases the probability that they will contract cancer. This seems to demonstrate, fairly conclusively, that, in some sense, saccharin "causes" cancer.

However, many of the natural substances ingested regularly by human beings would cause serious medical problems if consumed in suitably large amounts. The problem here is that there is a threshold for such substances: less than some amount, and they are easily processed by the body; once that amount is surpassed, however, there is a sudden increase in the probability that the body will suffer ill effects.

So, the result that saccharin "causes" cancer in some sense is irrelevant to the determination of "safe" behavior; really, we must know what the pertinent threshold for saccharin is: how much must be ingested before the chance of getting cancer spikes? How close is this threshold to the amount of saccharin I might consume on a daily basis? In the words of E. T. Jaynes, Probability Theory, 2003:

For virtually every organic substance (such as saccharin or cyclomates), the existence of a finite metabolic rate means that there must exist a finite threshold dose rate, below which the substance is decomposed, eliminated, or chemically altered so rapidly that it causes no ill effects. If this were not true, the human race could never have survived to the present time, in view of all the things we have been eating.

Indeed, every mouthful of food you and I have ever taken contained many billions of kinds of complex molecules whose structure and physiological effects have never been determined - and many millions of which would be toxic or fatal in large doses. We cannot doubt that we are daily ingesting thousands of substances that are far more dangerous than saccharin - but in amounts that are safe, because they are far below the various thresholds of toxicity. At present there are hardly any substances, except some common drugs, for which we actually know the threshold.

Therefore, the goal of inference in this field should be to estimate not only the slope of the response curve but, far more importantly, to decide whether there is evidence for a threshold; and, if there is, to estimate its magnitude (the 'maximum safe dose'). For example, to tell us that a sugar substitute can produce a barely detectable incidence of cancer in doses 1000 times greater than would ever be enountered in practice, is hardly an argument against using the substitute; indeed, the fact that it is necessary to go to kilodoses in order to detect any ill effects at all, is rather conclusive evidence, not of the danger, but of the safety, of a tested substance. A similar overdose of sugar would be far more dangerous, leading not to barely detectable harmful effects, but to sure, immediate death by diabetic coma; yet nobody has proposed to ban the use of sugar in food.


Modeling Continuity where Discreteness is Needed

A related problem to that of threshold is how to take a continuous model and parse it into the discrete chunks relevant for discussion in natural language and, by extension, policy decision.

Consider, for example, the question of whether or not second hand smoking laws are scientifically justified. Research that shows second hand smoke is "just as dangerous" as "first hand" smoke depends upon data involving the ingestion of equivalent amounts of smoke. Second hand smoke disperses, however, and is not inhaled by bystanders with the same intensity as "first hand" smoke. The pertinent question, then, is not is second hand smoke dangerous?, but rather, at what distance, and in what ventilation conditions is second hand smoke dangerous?

The dispersal of second hand smoke is a phenomenon suitable for statistical modeling. Such models do not tell us what is safe, however, merely the density of particles due to second hand smoke at various distances from the source in various ventilation conditions. Here, we have a continuous phenomenon, the dispersal of second hand smoke, and a discrete classification, safe or not safe. The problem is that statistical modeling of the continuous situation does not fully determine the locus upon that continuum of the boundary between safe and non-safe; yet it is this second question which is relevant to policy decision.

For example, a recent Stanford University study, one of the first peer-reviewed articles on the topic, emphasizes the support of their findings for public conceptions about second hand smoke dispersal (the results are summarized here), in particular, as evidence for a ban on smoking in public places. However, personal communication with one of the graduate students working on the project revealed that at prior stages of analysis there was surprise at how little public fears were confirmed and how little justification for such laws could be found. Whence this apparent disagreement? The study shows that, on the one hand, yes, there are quite high levels of particles in the air near an outdoor smoker; on the other hand, the density of particles drops off quite dramatically within a fairly small distance of the smoker and is highly susceptible to ventilation effects. Furthermore, the effect lasts for a relatively short time. Since the "safe" background level of particles is defined in terms of average density over a 24 hour period, it is not all clear how the localized and relatively short period of increased particle density compares with that contributed by, say, a passing car.

The Clustering Illusion

The clustering illusion occurs when humans see patterns in "random," or mathematically unpatterned, data. Everyday examples of this phenomenon include hearing the murmur of voices in the sound of a bubbling brook, the perception of a gambler that he is on a "winning streak," and instances of religious pareidolia, such as the "nun bun."

Such perceptions of non-existent pattern are relatively benign, however, as the poor choices they inspire tend to effect only a handful of individuals. When the objects of the illusion are potentially relevant to public policy, however, the clustering illusion can waste millions of dollars, inspire fear, and generate spurious "research." An example is the power lines ~ leukemia scare.

Law suits, local legislation, and widespread fear were caused throughout the '80s by the idea that power lines might, via their surrounding electromagnetic fields, somehow increase the possibility that a child contracts leukemia (or, perhaps, some other form of cancer). The inspiration for the studies centered on clustering phenomena, in particular, the apparent clustering of leukemia sufferers in a neighborhood "near" power lines. Subsequent research demonstrated conclusively that there was no measurable correlation between proximity to power line generated magnetic fields and tendency to contract leukemia, a result consonant with current scientific theories of magnetic fields. Nevertheless, public misconception and superstition persist; the damage of misguided and statistically unfounded claims continues to play out in the those corners of the public sphere dominated by paranoia and underinformed superstition.

A nice summary of the issue by Edward Campion can be found here. We quote only the conclusion:

Serious limitations have been pointed out in nearly all the studies of power lines and cancer. These limitations include unblinded assessment of exposure, difficulty in making direct measurements of the constantly varying electromagnetic fields, inconsistencies between the measured levels and the estimates of exposure based on wiring configurations, recall bias with respect to exposure, post hoc definitions of exposure categories, and huge numbers of comparisons with selective emphasis on those that were positive. Both study participation and residential wire-code categories may be confounded by socioeconomic factors. Often the number of cases of [acute lymphoblastic leukemia] in the high-exposure categories has been very small, and controls may not have been truly comparable. Moreover, all these epidemiologic studies have been conducted in pursuit of a cause of cancer for which there is no plausible biologic basis. There is no convincing evidence that exposure to electromagnetic fields causes cancer in animals, and electromagnetic fields have no reproducible biologic effects at all, except at strengths that are far beyond those ever found in people's homes.


The point here is not just that fear and irrational paranoia on the part of the public can motivate policy choices (in contradiction to or via abuse of the results of statistical analysis), but further that such paranoia can motivate bad research and lower standards of scientifc rigor from those necessary to ensure meaningful results.

next: intelligent design

Thursday, February 14, 2008

parable R

In the Kingdom of Decadence, where indulgence reigns and self-aggrandizement the weathercock of virtue guides, the lepers of sense their jealous rage did leash 'pon unsuspecting proletarian dreams, draining and desaturating symbolism from symbol, meaning from meant.
Babel forthwith her tremulous head did raise, aquiver in apprehension, expectation primed by cryptographic sensuality; a bevy of humpty-dumptyisms, anomalous intensions, swarm the façon de parler of the plebs, scum-drenched street urchin babbling in tongues like holy men, and with the mutant language, visions, themselves mutations of normal thought, a perversion upon the mind's eye.
And inward turned, the gaze of the "I" lit upon the mishapen folds of the language organ, piercing, incising, invigorating its dormant telos, embarrassing its ego, with efficient and sensitive touch, releasing, relinquishing, remanding, and constructing anew syllable, that subtle tempo, the pacemaker of thought: word.

Tuesday, February 12, 2008

understanding the mind IX

Coming as I do between psyche anatomized and psyche synthesized, I must define my terms that I can bridge the traditional gulf between mind and body and the technical gap between things begotten and things made.

By the term "mind," I mean ideas and purposes. By the term "body," I mean stuff and process. Stuff and process are familiar to every physicist as mass and energy in space and time, but ideas and purposes he keeps only in the realm of discourse and will not postulate them of the phenomena he observes. In this I agree with him, but what he observes is some sort of order or invariance in the flux of events. Every object he detects in the world is some sort of regularity. . .

To detect regularities in the relations of objects and so construct theoretical physics requires the dsciplines of logic and mathematics. In these fundamentally tautological endeavors we invent surprising regularities, complicated transformations which conserve whatever truth may lie in the propositions they transform. . . . It is these regularities, or invariants, which I call ideas, whether they are theorems of great abstraction or qualities simply sensed.

But let us now compel our physicist to account for himself as a part of the physcial world. In all fairness, he must stick to his own rules and show in terms of mass, energy, space, and time how it comes about that he creates theoretical phsyics. He must then become a neurophysiologist. . . , but in so doing he will be compelled to answer whether theoretical physics is something which he can discuss in terms of neurophysiology. . . To answer "no" is to remain a physicist undefiled. To answer "yes" is to become a metaphysician—or so I am told.


Warren S. McCulloch, "Why the Mind is in the Head," 1951

Tuesday, February 5, 2008

authorial intent: there will be blood

A puzzling, but compelling film.

One particularly interesting aspect are the brothers Paul and Eli Sunday, both played by Paul Dano. Although the wikipedia entry has since been altered (?), mere days ago it contained a section (4.1, if memory serves) on the Paul / Eli Sunday debate (see, for example, here), one of the most interesting sections of the entry, bizarrely removed by busybody Jim Dunning. The issue is this: are they twin brothers, or a single (schizophrenic?) person?

In favor of the former interpretation is Paul Thomas Anderson's intent - according to the now missing section of the wikipedia article, the decision to cast Eli Sunday with Paul Dano was made after filming had already commenced, and Dano's part as Paul Sunday had already been filmed. Purportedly, a younger actor playing Eli was replaced mid-filming (possibly due to Daniel Day Lewis' intimidating practice of staying perpetually in character). The brothers interpretation is also supported by the script.

However, the two characters never appear in the same scene, and Eli exhibits schizophrenic behavior in other scenes (he is, after all, a revivalist preacher who "casts the devil" out of his sick parishioners). When Daniel Plainview (Lewis) taunts Eli by telling him how much he paid Paul Sunday for information about their farm, giving a falsified amount, he almost seems to be daring Eli / Paul to admit his deceit, to break free from his schizophrenia. (Also, note that Paul questions Lewis on his religion before Eli even comes into the picture.)

The question, though, is not what interpretation we want to give here, but rather, what interpretation are we permitted to give - may we consider the author "dead" even as he walks among us, decrying our attempts at interpretation? Of course, most of you reading this will give the quick and obvious answer - but will you all give the same one?

Sunday, February 3, 2008

Friday, February 1, 2008

information terrorism

Knowledge has always meant power, but knowledge must be combined with raw materials to realize that power. Knowledge of how to build a catapult is useless without the resources to both build a catapult and transport it to your enemy's doorstep.

Throughout history, tyranny has prevailed over enlightenment due to its possession of sufficient resources, not just to realize the fruits of knowledge, but, perhaps more importantly, to control the flow of knowledge.

Such a system is self-perpetuating and induces the illusion that it is the only viable, or only rational, solution to the problem of coordinating activity amongst a heterogenous population.

In the age of the internet and the linked in computer, however, both the resources needed to realize knowledge, and those needed to transport it to one's enemy's door have diminished exponentially. For the ubiquity of the computer for the purposes of organizing information has ensured that everyone is vulnerable to attack. The catapult can be realized with a few lines of code, and the internet offers a highway to transport it to your door.

Although the individual is put at risk, so much more so those authoritarian organizations which had heretofor supressed knowledge through their control of resources, both labor and materials, and, perhaps more importantly, their control of the flow of knowledge. In the modern age, all information is virus, permeating the internet, infecting those who are linked in.

Not only political bodies have abused this power, yet now find themselves threatened. Consider, for example, ANONYMOUS' threat against Scientology:



ANONYMOUS is (supposedly) a cabal of secret "superhackers" (though they themselves claim to include divers members of society, from lawyers to "veterinary technicians"). They threaten Scientology with vigilante justice of some kind. So far, success of related groups has been limited; however, the response by Scientology is just the typical move of the authoritarian ~ throw more resources (money) at the problem, in this case, primarily for purchasing the knowledge of others.

Yet, if the trend continues, the ability of authoritarian bodies to combat knowledge with material resources will dwindle to naught, and we may finally enter the age of the philosopher kings.