Can Twitter be mined for information on food poisoning outbreaks? One Google data scientist thinks so. Adam Sadilek led a team at the University of Rochester that developed Nemesis , a machine learning system which asks "which restaurants should you avoid today?"
Using a set of keywords, Nemesis mines Twitter for geolocated posts that could be indicative of foodborne illness. In tests, tweets from New York were datamined and had metadata added indicating restaurants within 25 meters that were open at the time the user tweeted. A team of humans recruited via Mechanical Turk then came up with 27 words and phrases indicating food poisoning--things like "My tummy hurts," "stomachache," "throw up," "Mylanta," and "Pepto-Bismol." Nemesis then assigned health scores to the nearby restaurants based on the proportion of food poisoning-inferring tweets.