Data scientists have hundreds of probability distributions from which to choose. Where to start? Data science, whatever it may be, remains a big deal. “A data scientist is better at statistics than any software engineer,” you may overhear a pundit say, at your local tech get-togethers and hackathons. The applied mathematicians have their revenge, because statistics hasn’t been this talked-about since the roaring 20s. They have their own legitimizing Venn diagram of which people don’t make fun. Read More
Welcome to my new column on the new pages of TDAN.com. This quarterly column will address data from a personal perspective and is written to get you to think about the importance of data in our daily lives. I cannot tell you how often I tell people, after lengthy discussions about seemingly unconnected subjects, that […]
I remember the days of Bezdek, fuzzy c-means clustering. My humble team developed algorithms to classify landmines in Angola. We spent a lot of time looking at the data, matrices and vectors before selecting a random sample group. Principal component analysis was another popular method to compress the data to decrease the cost of algorithms. It was not too long ago that I wrote my dissertation on it in 2010.
Tim O’Reilly has been at the cutting edge of the Internet since it went commercial. In fact, he helped take it there: In August 1993 he released the Global Network Navigator, a web page containing information, catalogs and a marketplace, which may have been the first site with advertising.
Today we are announcing MongoDB 3.0. This release marks the beginning of a new phase in which we build on an increasingly mature foundation to deliver a database so powerful, flexible, and easy to manage that it can be the new DBMS standard for any team, in any industry.
You probably know IBM's Watson platform best from its winning performance on Jeopardy. But the supercomputer is more than just a mechanism for IBM to publicly shame smart people. It's arguably the most powerful natural-language supercomputer in the world, and thanks to a new public beta, its number-crunching abilities are open to all.
Organizations ranging from Zipcar to bike-sharing programs rely on remote unlocking and return of assets. Sharing economy companies that rely on individual assets need to do the same to compete on experience and drive member loyalty.
Organize anything, together. Trello is a collaboration tool that organizes your projects into boards. In one glance, know what's being worked on, who's working on what, and where something is in a process.
What is text analytics and how can it be beneficial to my business, skillset, or predictive models? If you’ve searched out this website, it is likely that you are here to learn the how of text analytics. In this case, we will primarily address the how with IBM/SPSS Modeler. But, we will also answer the question of why. Why will be answered several times over in the use case section, but in overview the broader question is “What is text analytics?”
Figure 1: Network Diagram for the period before the falling of Oil pricesBig Data is indeed disrupting our industries and to a data scientist, the best way to prove that and show it to people is to play around with some data!The fall of Oil prices is one of the most prominent topics in our world nowadays, so its a matter of curiosity for any data enthusiast to see what Big Data can tell us about the Oil market's scene. We analyzed hundred of thousands of news article mentioning the Oil & Gas discussions before and after the fall of Oil prices in a 6 months’ time frame. We used the G
From the Domesday Book to modern government papers, the National Archives' collection of more than 11m historical government and public records is one of the world’s largest. It includes paper and parchment…
Sharing your scoops to your social media accounts is a must to distribute your curated content. Not only will it drive traffic and leads through your content, but it will help show your expertise with your followers.
How to integrate my topics' content to my website?
Integrating your curated content to your website or blog will allow you to increase your website visitors’ engagement, boost SEO and acquire new visitors. By redirecting your social media traffic to your website, Scoop.it will also help you generate more qualified traffic and leads from your curation work.
Distributing your curated content through a newsletter is a great way to nurture and engage your email subscribers will developing your traffic and visibility.
Creating engaging newsletters with your curated content is really easy.