Code: Big Data
2.2K views | +0 today
Follow
Code: Big Data
Research and Practice in Big Data Analytics
Curated by Jose Menes
Your new post is loading...
Your new post is loading...
Scooped by Jose Menes
Scoop.it!

Big Data’s Mathematical Mysteries | Quanta Magazine

Big Data’s Mathematical Mysteries |  Quanta Magazine | Code: Big Data | Scoop.it
Machine learning works spectacularly well, but mathematicians aren’t quite sure why.
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

[1604.02492] Challenges in Bayesian Adaptive Data Analysis

Traditional statistical analysis requires that the analysis process and data are independent. By contrast, the new field of adaptive data analysis hopes to understand and provide algorithms and accuracy guarantees for research as it is commonly performed in practice, as an iterative process of proposing hypotheses and interacting with the data set. 
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Pedro Domingos’ Master Algorithm: How machine learning is reshaping how we live.

Pedro Domingos’ Master Algorithm: How machine learning is reshaping how we live. | Code: Big Data | Scoop.it
Computers and the algorithms they run are precise, perfect, meticulously programmed, and austere. That’s the idea, anyway.
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Why Zipf's law explains so many big data and physics phenomenons

Why Zipf's law explains so many big data and physics phenomenons | Code: Big Data | Scoop.it
The Zipf's law states that in many settings (that we are going to explore), the volume or size of entities is inversely proportional to a power s (s > 0…
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Leaving Data on the Table: Data Scientists Reveal Obstacles to Big Data - insideBIGDATA

Leaving Data on the Table: Data Scientists Reveal Obstacles to Big Data - insideBIGDATA | Code: Big Data | Scoop.it

The huge volume of Big Data produced by sensors, genomic sequencers, electronic exchanges, and connected devices continues to generate headlines but it’s the diverse types of data, not the volume, that’s a bigger challenge to data scientists and is causing them to “leave data on the table.”

more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Building a Better Brain: Saffron Cognitive Computing Platform Replicates How We Associate Facts - insideBIGDATA

Building a Better Brain: Saffron Cognitive Computing Platform Replicates How We Associate Facts - insideBIGDATA | Code: Big Data | Scoop.it
Building a Better Brain: Saffron Cognitive Computing Platform Replicates How We Associate Facts
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

70+ websites to get large data repositories for free « Big Data Made Simple

70+ websites to get large data repositories for free « Big Data Made Simple | Code: Big Data | Scoop.it
Do you require GBs of data to check the performance of your app? The easiest way is to download samples of data from free data repositories available on the Web. But the main disadvantage of this approach is the data will have very less unique content and it may not
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Mode raises $2M and opens 'GitHub for data' to the public

Mode raises $2M and opens 'GitHub for data' to the public | Code: Big Data | Scoop.it
Mode is trying to do for data scientists and analysts what GitHub did for developers by giving them a place where they can find, collaborate and work on data. Formation8 led the new round, which also included Reddit’s Alexis Ohanian.
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

A Tour of Machine Learning Algorithms

A Tour of Machine Learning Algorithms | Code: Big Data | Scoop.it
Originally published by Jasonb on MachineLearningMastery.com.
From the Ensemble Methods section
Learning Style
There are different ways an algorithm can model…
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Big Data is the New Engine of the Internet

Big Data is the New Engine of the Internet | Code: Big Data | Scoop.it
Mobile devices and expanding networks of sensors are generating huge amounts of data that are increasingly searchable and can be shared to discern patterns
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Is Python Becoming the King of the Data Science Forest? - Experfy Insights

Is Python Becoming the King of the Data Science Forest? - Experfy Insights | Code: Big Data | Scoop.it
Python is quickly gaining ground over R for data science. We consider the reasons for this shift and whether R has a future within the big data ecosystem.
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Twitter to Release All Tweets to Scientists: A Trove of Billions of Tweets Will Be a Research Boon and An Ethical Dilemma

Twitter to Release All Tweets to Scientists: A Trove of Billions of Tweets Will Be a Research Boon and An Ethical Dilemma | Code: Big Data | Scoop.it
A trove of billions of tweets will be a research boon and an ethical dilemma
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

The biggest Big Data project in the universe

The biggest Big Data project in the universe | Code: Big Data | Scoop.it
The biggest amount of data ever gathered and processed passing through the UK, for scientists and SMBs to slice, dice, and turn into innovations and insights. When Big Data becomes Super-Massive Data.
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Recycling Deep Learning Models with Transfer Learning

Recycling Deep Learning Models with Transfer Learning | Code: Big Data | Scoop.it
Deep learning exploits gigantic datasets to produce powerful models. But what can we do when our datasets are comparatively small? Transfer learning by fine-tuning deep nets offers a way to leverage existing datasets to perform well on new tasks.
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Stream Processing – What Is It and Who Needs It - Data Science Central

Stream Processing – What Is It and Who Needs It - Data Science Central | Code: Big Data | Scoop.it
Summary: Stream Processing and In-Stream Analytics are two rapidly emerging and widely misunderstood data science technologies. In this article we’ll focus o…
more...
No comment yet.
Rescooped by Jose Menes from Digital Transformation of Businesses
Scoop.it!

Beginner's Guide to Eigenvectors, PCA, Covariance & Entropy- maths behind #deepLearning

Beginner's Guide to Eigenvectors, PCA, Covariance & Entropy- maths behind #deepLearning | Code: Big Data | Scoop.it

This post introduces eigenvectors and their relationship to matrices in plain language and without a great deal of math. It builds on those ideas to explain covariance, principal component analysis, and information entropy.

The eigen in eigenvector comes from German, and it means something like “very own.” For example, in German, “mein eigenes Auto” means “my very own car.” So eigen denotes a special relationship between two things. Something particular, characteristic and definitive. This car, or this vector, is mine and not someone else’s.

Matrices, in linear algebra, are simply rectangular arrays of numbers, a collection of scalar values between brackets, like a spreadsheet. All square matrices (e.g. 2 x 2 or 3 x 3) have eigenvectors, and they have a very special relationship with them, a bit like Germans have with their cars.


Via Farid Mheir
more...
Farid Mheir's curator insight, August 21, 2015 6:33 AM

An explanation of some key mathematical concepts required to make deep learning possible.


WHY IS THIS IMPORTANT

Behind what looks like "magic" on Google search, google translate or IBM Watson lies mathematics algorithm that process large amount of data. This is essential information to understand how digital computing, big data and deep learning works.

Scooped by Jose Menes
Scoop.it!

Fast clustering algorithms for massive datasets

Fast clustering algorithms for massive datasets | Code: Big Data | Scoop.it
Here we discuss two potential algorithms that can perform clustering extremely fast, on big data sets, as well as the graphical representation of such complex…
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

A Large set of Machine Learning Resources for Beginners to Mavens

A Large set of Machine Learning Resources for Beginners to Mavens | Code: Big Data | Scoop.it
Note : I regularly update this list. Machine Learning 101: I. Introduction to Machine Learning http://homepages.inf.ed.ac.uk/rbf/IAPR/researchers/MLPAGES/mltut.htm http://jeremykun.com/2012/08/04/machine-learning-introduction/ http://www.omidrouhani.com/research/machinelearning/html/machinelearning.htm http://www.youtube.com/playlist?list=PLD63A284B7615313A (cal tech class) II.  Linear Regression http://en.wikipedia.org/wiki/Linear_regression http://www.youtube.com/watch?v=ExVhaN36jBs http://en.wikipedia.org/wiki/Simple_linear_regression http://www.youtube.com/watch?v=ocGEhiLwDVc     III) Linear Algebra http://ocw.mit.edu/courses/mathematics/18-06sc-linear-algebra-fall-2011/Syllabus/ https://www.khanacademy.org/math/linear-algebra online text http://joshua.smcvt.edu/linearalgebra/book.pdf - see http://joshua.smcvt.edu/linearalgebra/ for usage rights V) Linear Regression with Multiple Variables - Gradient Descent http://en.wikipedia.org/wiki/Gradient_descent http://www.youtube.com/watch?v=umAeJ7LMCfU (discusses above wiki …
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Views from the front lines of the data-analytics revolution | McKinsey & Company

Views from the front lines of the data-analytics revolution | McKinsey & Company | Code: Big Data | Scoop.it
At a unique gathering of data-analytics leaders, new solutions began emerging to vexing privacy, talent, organizational, and frontline-adoption challenges. A McKinsey Quarterly article.
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Conjecture: Scalable Machine Learning in Hadoop with Scalding « Code as Craft

Conjecture: Scalable Machine Learning in Hadoop with Scalding « Code as Craft | Code: Big Data | Scoop.it

Predictive machine learning models are an important tool for many aspects of e-commerce.  At Etsy, we use machine learning as a component in a diverse set of critical tasks. For instance, we use predictive machine learning models to estimate click rates of items so that we can present high quality and relevant items to potential buyers on the site.

more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

The Philosophy of Data

The Philosophy of Data | Code: Big Data | Scoop.it
Our ability to gather and process huge amounts of data does many things, including correcting intuitive biases and illuminating patterns of behavior.
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

Skymind launches with open-source, plug-and-play deep learning features for your app

Skymind launches with open-source, plug-and-play deep learning features for your app | Code: Big Data | Scoop.it
In Silicon Valley, deep learning ranks as one of the hottest technologies. Now, this startup sees a chance to let lots of developers incorporate deep learning into their apps. Deep learning essenti...
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

10 Big Data Pros To Follow On Twitter - InformationWeek

10 Big Data Pros To Follow On Twitter - InformationWeek | Code: Big Data | Scoop.it
Looking for big data expertise on Twitter? Start by following these 10 industry players.
more...
No comment yet.
Scooped by Jose Menes
Scoop.it!

100+ Interesting Data Sets for Statistics

100+ Interesting Data Sets for Statistics | Code: Big Data | Scoop.it
Looking for interesting data sets? Here's a list of more than 100 of the best stuff, from dolphin relationships to political campaign donations to death row prisoners.
more...
No comment yet.