This Saturday was International Open Data Day. Over a hundred cities worldwide hosted events in which applications were built, data was liberated, and the open data cause was promoted. The interactive IODD map gives some sense of the scale of the event.
Visualizing.org launched the Global Development Sprint in collaboration with the World Bank on Open Data Day. This is “a collaborative data visualization project where anyone can fork code to visualize the same open data set”, aiming to demonstrate the benefit of open data policies.
Probabilistic programming, as Rob Zinkov explains, is a powerful and intuitive way of specifying complex models. You can learn this paradigm with “Probabilistic Programming and Bayesian Methods for Hackers”, a Github-hosted ebook that provides a hands-on introduction to Bayesian inference in Python and PyMC.
Syracuse University’s Jeffrey Stanton has made his “Introduction to Data Science”, an interactive textbook for a non-technical audience, available for free. Syracuse will also offer a free and open online course, “A Brief Introduction to Data Science with R”, later this month.
DataFreeze allows you to script static JSON and CSV exports of relational databases for use in data-driven apps. As its creator explains, this is a handy solution to the problem posed by the cumbersome datastores used by high-volume websites.
Videos of over 300 natural language processing lectures have been posted to Vimeo by Chris Callison-Burch. Everyone who works with text—that is, just about everyone!—should check out this collection of cutting-edge research presentations.