Is it just me, or has Randall been invoking the Bayes a lot lately?
statismensch wrote:I think this is a misunderstanding of Bayes rule. Let's consider the corrected version of the equation in the comic,
P(near ocean | seashell) ~ p(seashell | near ocean) * P(near ocean),
where "~" means proportional to. (I'm treating "seashell", the known quantity, as constant, and "near ocean", the unknown quantity, as stochastic.) If you're likely to be far away from the ocean to begin with, then P(near ocean | seashell) will be controlled mostly by your low prior probability, P(near ocean). If you're extremely unlikely to be near the ocean to begin with, then it won't make a difference whether you pick up the shell or not. P(near ocean | seashell) will still be low because your prior belief of being far away is so strong.
Wait, where did P(seashell) go? You can't just drop it, it depends on P(near ocean)! I think we can agree that everyone sees about the same number of non-ocean seashells, at children's museums and stuff. However, people who live near (seashell-strewn) beaches see a
lot more ocean seashells than people who don't, so their
total number of seashells is much higher, and for them any given seashell is very likely to be an ocean seashell. I think this is true of Randall, since he lives in Massachusetts and seems to like beaches.
Your argument
does make sense for people who mostly (or always) see non-ocean seashells. I personally encounter shells big enough to listen to so rarely that I'm not sure I can weigh in, though I think one of my elementary school teachers had one and my school wasn't on a beach, so I guess I was in the "mostly non-ocean seashells" group while I was in her class.
statismensch wrote:Doctors have a similar problem testing for rare diseases. Let's say I test positive for the Siberian Swazi-flu (a rare condition that I'm making up). My posterior probability of having the disease is
P(sick | +) ~ P(+ | sick) P(sick)
Since the disease is so rare, P(sick) is extremely low, and so the chances of my having the disease are still low even though I tested positive. Epidemiologists think about this sort of thing a lot.
You could apply similar logic here if testing positive is
also extremely rare, and P(+ | sick) is reasonably sized. You can make up for even a small P(+ | sick) if P(+) is small enough, but such a test would not be very useful

. Of course, practically speaking I bet it's really hard to develop a good test for diseases that are seen rarely, just because you don't have many opportunities to study it...?
greymatters wrote:Better title for this comic: "Bays" Theorem
[sniff] God bless you, sir...god bless you.