It could be called POWDERKEG! Each week, I’ll be arrested without my rights being read to me and held for 14 hours while police refuse to tell me what charges I’m being held on. Meanwhile, the kumbaya squad will talk politics nonstop to see if they can make my head explode.
Responding to my earlier comments about science being easier at a distance, both Nude Cybot and Justin Mason have offered up substantial and useful comments on the subjects of biological taxonomies. (Justin’s have moved to email.)
“Classification in Biology, or phylogenetics, is fraught with issues that we typically do not face when creating our own systems of classification such as organization of content content on a website.” Is actually the exact opposite of my starting position as I learn about these. I thought that the ‘underlying realities’ of biology, that this descended from that, or in chemistry, there are this many electrons in a shell, lead to ‘natural taxnomies.’ Boy, was I ever wrong. (The periodic table can be read as a taxonomy, and the position of atoms in it predicts certain characteristics of those atoms. For example, the ‘noble gasses’ are off to the far right, and their electron shells are filled.)
It turns out that even with such natural divisions, there are many good ways to classify the kingdoms of nature. Ironically, Nudecybot points to Six Degrees: The Science of a Connected Age as a possible answer. Six degrees is, of course, a reference to a classic Milgram study that I wrote about a few days ago, saying that Milgram was better at the experiments than at the theories around them.
So, there’s no perfect taxonomy, only the question of is a taxonomy useful for the purpose at hand. And the purpose at hand needs a tighter definition than it has today.
If you ever saw Julia Child or Jacques Pepin take apart a chicken, you’ll remember how easy they made it look. It’s a level of skill that we can all aspire to.
Watching Ed Hasbrouck take apart the latest incarnation of free wheelchairs for paraplegic children is like watching Julia Child take apart a chicken. He does it so well that you don’t even stop to marvel at his skill. Go read what he has to say about the utter lack of sense and lack of legal standing that the TSA has to be implementing these programs.
In Wikipedia vs. Britannica Smackdown, Ed Felten takes my challenge. In the meanwhile, I’d done some hypothesizing, here.
So how’d I do?
Hypothesis 1 is spot on.
#2 is more challenging to assess: The errors in Britannica are smaller, and I think I’ll judge myself wrong.
#3 I think is accurate, if only because of the long entry on Microsoft.
#4 Ed did not assess, or comment on.
#5 Ed didn’t check Encarta.
So, I think I get 2 out of 3 for the tested hypothesis.
A few days ago, I challenged Ed Felten to do some more comparison work. In the spirit of Milgram, I didn’t propose a theory. (This was mostly because I was trying to make a good joke about assigning the professor homework, but couldn’t come up with one.) However, on consideration, I think that I should propose some theories, and also not influence the experiment.
So, hypothesis 1:
Wikipedia will have 30-50% more entry coverage than the others.
In particular, I don’t expect Ed Felten will have an entry, and I
expect one of his two computer science entries to not be in each
The quality of Wikipedia, measured by errors detected, will meet
that of the others.
Building a large encyclopedia is a lot of work, and I don’t expect that the quality assurance and fact checking will be great anywhere.
The quality of Wikipedia, measured by the depth of the entries,
will be substantially greater than the comparison.
Techies aren’t noted for brevity and conciseness, and the web doesn’t
have physical constraints holding down the size of the entries,
whereas each DVD you ship may add $2 to the cost of a product. I
expect that the difference would be largest against the print or CD
The quality of Wikipedia, as measured by the accessability of
entries, will be lower.
By accessability, I mean how good the
basic introduction and contextualization are, and how well the entry
takes you from no knowledge to some.
Ed will believe that Encarta’s entry on the Microsoft trial is
biased towards Microsoft.
An encyclopedia must be measured first on accuracy, and secondly on
breadth. A roomful of monkeys writing entries does not get you a
useful encyclopedia, but neither does one with one entry. (There are
a great many useful topical encyclopedias which address this by
constraining themselves to one subject.
I expect that Wikipedia’s accuracy will be roughly that of the others,
and it will win, hands down, on breadth and depth. However, this test
is biased by the selection of terms, where they are known to a
computer science professor. If my hypotheses pan out, it would be
fascinating to see if we could recruit from across the Princeton
faculty, to see if the same tests hold true across wider disciplines.
As part of a larger project on security configuration issues, I’m doing a lot of learning about taxonomies and typographies right now. (A taxonomy is a hierarchical typography.)
I am often jealous of the world of biology, where there are underlying realities that can be used for categorization purposes. (A taxonomy needs a decision tree. Any trained person using this tree should classify the same items the same way.)
A new type of shark has recently been discovered, in the Sea Star Aquarium, in Coburg, Germany. This is (at least) the second zoo that the shark has been in.
We are not embarrassed,” said [Schonbrunn Zoo] spokesman Dr Ekkehard Wolf. “We get thousands of exotic animals every year. It is not possible to categorize them all. (From The Telegraph.)
See a picture (and read the article) at Unterwasser.de or read Google’s translation
Even the lucky biologists run into difficulty classifying their species. I feel better trying to classify minimum time between password changes.
In Educated Guesswork, Eric Rescorla writes about one way tickets and the search criteria.
The CAPPS program was created by Northwest airlines, who set the criteria for inclusion. They included one way tickets to enforce their bizarre pricing schemes. This is the same reason they started asking for ID: to cut down on the resale of the other half of a round-trip ticket, which cost the same as a one-way.
CAPPS, incidentally, has been renamed the “free wheelchairs for paraplegic children” program, to make it harder to argue against, and to get around a congressional mandate that the program not be deployed until someone actually thinks it through.
In his comment, Kevin Dick gets it mostly right–there are other items that you want to keep off the planes (pepper spray, for example), but the right technique in a free society involves enabling passengers to fight for their lives, and fortifying the flight deck. There’s a lot that could be done that hasn’t been. For example, consider an “airlock” system, with two doors at the front of the plane, with a restroom inside. The doors open one at a time. There may be an air marshall inside. (A curtain prevents anyone from seeing.) Now hijackers need to get through two doors. They can’t storm the cockpit while the pilots are being fed or using the restroom. This is very expensive. It would require a new bathroom for the high-revenue business travelers up front. It makes a section of plane unusable for reveune generation. But it is entirely free of civil liberties implications for fliers.
Over at Freedom To Tinker, Ed Felten writes about the Wikipedia quality debate.
He takes a sampling of six entries where he’s competent to judge their quality, and assesses them. Two were excellent, one was slightly inaccurate, two were more in depth, but perhaps less accessible than a standard encyclopedia, and one (on the US Microsoft anti-trust case) was error-prone.
Ed writes: “Until I read the Microsoft-case page, I was ready to declare Wikipedia a clear success.” However, I think his experiment is only one-third to one-half done. I think that Ed ought to look up the same 6 entries in another encyclopedia or two, and report back. I’d suggest the Britannica, which is usually considered the gold standard, and perhaps Microsoft’s Encarta, which may be the most widely used.
I can’t do this experiment the way Ed can, because firstly, I don’t have an EB account, and second, because I don’t know all the topics to the depth he does. I could pretend, and perhaps miss errors that he’d catch, or sample six other articles, and perhaps I will over the weekend.
Over at TaoSecurity, Richard writes:
Remember that one of the best ways to prevent intrusions is to help put criminals behind bars by collecting evidence and supporting the prosecution of offenders. The only way to ensure a specific Internet-based threat never bothers your organization is to separate him from his keyboard!
Firstly, I’m very glad that the second, qualifying sentence is there. It provides some context. However, I’m not sure that I care that a specific threat stops, what I care about is that the class of threats go away.
If the odds that a specific criminal hacker goes to jail are low, then the penalties need to be exceptionally severe and well publicised to create a deterrent effect. (This is roughly a criminal attack loss expectancy, which someone smart has done work on.)
We can see that the odds that an attacker goes to jail are relatively small because there is clearly a large attacker population, and very few criminal sentencings. I’m curious how many attacker convictions we’d need each year to change the economics of this and deter 15 year olds from bringing down CNN?