Big Data Analytics and Science
14.2K views | +3 today
Follow
 
Scooped by Dahl Winters
onto Big Data Analytics and Science
Scoop.it!

Big Data That’s Good for the Public | MIT Sloan Management Review

Big Data That’s Good for the Public | MIT Sloan Management Review | Big Data Analytics and Science | Scoop.it
The promise of DOPA: Linked information about markets, trends, competitors, products and consumers. (Big Data That’s Good for the Public: Image courtesy of Flickr user cszar. Facts: 900 million.
more...
No comment yet.
The latest in what you need to know to handle and explore big data.
Curated by Dahl Winters
Your new post is loading...
Your new post is loading...
Scooped by Dahl Winters
Scoop.it!

AI Is Transforming Google Search. The Rest of the Web Is Next

AI Is Transforming Google Search. The Rest of the Web Is Next | Big Data Analytics and Science | Scoop.it
As Google's head of artificial intelligence takes charge of search, deep learning is already changing the way Googling works.
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Microsoft releases deep learning toolkit on GitHub. Now bring on the AI research

Microsoft releases deep learning toolkit on GitHub. Now bring on the AI research | Big Data Analytics and Science | Scoop.it

Computational Network Toolkit finally gets open source licence

more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Empirically-Based Approach to Understanding the Structure of Data Science

Empirically-Based Approach to Understanding the Structure of Data Science | Big Data Analytics and Science | Scoop.it
The study finds evidence that data science skills fall into three broad areas.
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Best practices in HDFS authorization with Apache Ranger

Best practices in HDFS authorization with Apache Ranger | Big Data Analytics and Science | Scoop.it
Learn more about best practices in HDFS authorization with Apache Ranger
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

eBay’s new Pulsar framework will analyze your data in real time

eBay’s new Pulsar framework will analyze your data in real time | Big Data Analytics and Science | Scoop.it
eBay has a new open-source, real-time analytics and stream-processing framework called Pulsar that the company claims is in production and is available for others to download, according to an eBay blog post on Monday. The online auction site is now using Pulsar to gather and process all the data pertaining to user interactions and their…
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

BlazeGraph - Open-Source Scalable Graph Database

BlazeGraph - Open-Source Scalable Graph Database | Big Data Analytics and Science | Scoop.it

SYSTAP is very pleased to launch it’s new graph database platform Blazegraph™. It is built on the same open source GPLv2 platform and maintains 100% binary and API compatibility with Bigdata®. Blazegraph™ will take over as SYSTAP’s flagship graph database. It is specifically designed to support big graphs offering both Semantic Web (RDF/SPARQL) and Graph Database (tinkerpop, blueprints, vertex-centric) APIs. It features robust, scalable, fault-tolerant, enterprise-class storage and query and high-availability with online backup, failover and self-healing. It is in production use with enterprises such as Autodesk, EMC, Yahoo7!, and many others. Blazegraph™ provides both embedded and standalone modes of operation.

Blazegraph has a High Availability and Scale Out architecture. It provides robust support for Semantic Web (RDF/SPARQ)L and Property Graph (Tinkerpop) APIs. Highly scalable Blazegraph graph can handle 50 Billion edges on a single node.

more...
vitonzhang's curator insight, May 4, 2016 12:05 AM
Share your insight
Scooped by Dahl Winters
Scoop.it!

Data Science with Apache Hadoop: Predicting Airline Delays

Data Science with Apache Hadoop: Predicting Airline Delays | Big Data Analytics and Science | Scoop.it
In this multi-part blog post, we will demonstrate Machine Learning techniques using existing modeling tools on Apache Hadoop. Part 1 uses Pig and Python.
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Inside the Apache Software Foundation's newest Top-Level Project: Apache Flink

Inside the Apache Software Foundation's newest Top-Level Project: Apache Flink | Big Data Analytics and Science | Scoop.it
Flink contributors talk Big Data processing, open-source community and the future of the newly minted TLP

 

Flink is an open-source Big Data system that fuses processing and analysis of both batch and streaming data. The data-processing engine, which offers APIs in Java and Scala as well as specialized APIs for graph processing, is presented as an alternative to Hadoop’s MapReduce component with its own runtime. Yet the system still provides access to Hadoop’s distributed file system and YARN resource manager.

more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

FAIR open sources deep-learning modules for Torch

FAIR open sources deep-learning modules for Torch | Big Data Analytics and Science | Scoop.it
The modules are significantly faster than the default ones in Torch and have accelerated research projects by allowing users to train larger neural nets in less time.
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Big Data vs. Cancer: Algorithm Identifies Genetic Changes across Cancers

Big Data vs. Cancer: Algorithm Identifies Genetic Changes across Cancers | Big Data Analytics and Science | Scoop.it
Using a computer algorithm that can sift through mounds of genetic data, researchers from Brown University have identified several networks of genes that, when hit by a mutation, could play a role in the development of multiple types of cancer.
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

IBM detects skin cancer more quickly with visual machine learning - Computerworld

IBM detects skin cancer more quickly with visual machine learning - Computerworld | Big Data Analytics and Science | Scoop.it
Skin cancer can be detected more quickly and accurately by using cognitive computing-based visual analytics, researchers at IBM Research have found, in collaboration with New York's Memorial Sloan Kettering Cancer Center.
more...
No comment yet.
Rescooped by Dahl Winters from Docker
Scoop.it!

A Docker Image for Graph Analytics on Neo4j with Apache Spark GraphX

A Docker Image for Graph Analytics on Neo4j with Apache Spark GraphX | Big Data Analytics and Science | Scoop.it
This docker image is a great addition to Neo4j if you're looking to do easy PageRank or community detection on your graph data. Additionally, the results of the graph analysis are applied back to Neo4j.

Via Docker
more...
No comment yet.
Rescooped by Dahl Winters from Amazing Science
Scoop.it!

Discovering the Undiscovered - DOE Joint Genome Institute

Discovering the Undiscovered - DOE Joint Genome Institute | Big Data Analytics and Science | Scoop.it
Advancing New Tools to Fill in the Microbial Tree of Life To paraphrase a famous passage from Coleridge’s “The Rime of the Ancient Mariner”: microbes, microbes everywhere, though most we do not know. This is changing, though.

Via Dr. Stefan Gruenwald
Dahl Winters's insight:

A big use of big data - exploring the genomes of life on Earth.  One of the biggest data sets in the world is the one we carry around with us and on us every day.

more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

The Neural Network That Remembers

The Neural Network That Remembers | Big Data Analytics and Science | Scoop.it
With short-term memory, recurrent neural networks gain some amazing abilities
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Session with Yoshua Bengio / Jan 19, 2016 - Quora

Session with Yoshua Bengio / Jan 19, 2016 - Quora | Big Data Analytics and Science | Scoop.it
Dahl Winters's insight:

Yoshua Bengio, one of the most influential people in deep learning, will be hosting a live Q&A on Quora tomorrow (January 19) at 4 PM.  You may need to have a Quora account to post a question for him to answer.

 

This is a huge opportunity if you are at all interested in deep learning, one of the most transformative methods in big data analytics today.

more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Striking black gold with big data

Striking black gold with big data | Big Data Analytics and Science | Scoop.it
A global oil and gas major re-imagined oil exploration operations with predictive lithology supported by fuzzy logic and big data, to improve prediction accuracy, enhance scalability and reduce costs.
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Interactive Analytics on Dynamic Big Data in Python using Kudu, Impala, and Ibis - Cloudera Engineering Blog

Interactive Analytics on Dynamic Big Data in Python using Kudu, Impala, and Ibis - Cloudera Engineering Blog | Big Data Analytics and Science | Scoop.it
The following post was originally published in the Ibis project blog. (Ibis is a data analysis framework incubating in Cloudera Labs that brings Apache Hadoop scale to Python development.)
The new Apache Kudu (incubating) columnar storage engine together with Apache Impala (incubating) interactive SQL engine enable a new fully open source big data architecture for data that is arriving and changing very quickly. By integrating Kudu and Impala with Ibis,  Read More
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Docker and Mesos: Like peanut butter and jelly

Docker and Mesos: Like peanut butter and jelly | Big Data Analytics and Science | Scoop.it
You want to run Docker containers, but how do you do so at hyper scale? Apache Mesos may be the answer. Matt Asay explains.
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Great list of resources: data science, visualization, machine learning, big data

Great list of resources: data science, visualization, machine learning, big data | Big Data Analytics and Science | Scoop.it
Fantastic resource created by Andrea Motosi. I've only included the 5 categories that are the most relevant to our audience, though it has 31 categories total,…
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Lockheed Martin Releases Open-Source GUI for Real-Time Apache Storm Data Processing

Lockheed Martin Releases Open-Source GUI for Real-Time Apache Storm Data Processing | Big Data Analytics and Science | Scoop.it

StreamFlow™ is a stream processing tool designed to rapidly build and monitor processing workflows. The ultimate goal of StreamFlow is to make working with stream processing frameworks such as Apache Storm easier, faster, and with "enterprise" like management functionality.

StreamFlow provides a graphical user interface for non-developers such as data scientists, analysts, or operational users to rapidly build scalable data flows and analytics.

more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Image Classification with Convolutional Neural Networks – my attempt at the NDSB Kaggle Competition

Image Classification with Convolutional Neural Networks – my attempt at the NDSB Kaggle Competition | Big Data Analytics and Science | Scoop.it

On December 15th, Kaggle started the National Data Science Bowl competition (which runs till the end of March 2015). The competition consists of classifying images of ocean plankton in 121 different classes, with a supplied training set of around 30,000 labeled images, and a test set of 130,000 for which you have to provide the classification. The images are black and white, and in different sizes and shapes, with width and heights ranges roughly between 30 pixels and over 200 pixels. This is a real-world problem to tackle, while also providing through the leaderboard an ability to track your progress, as well as how you do compared to others.

Dahl Winters's insight:

A good overview of getting started fast with deep learning on a real-world problem.

more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

The Dark Corners of Our DNA Hold Clues about Disease

The Dark Corners of Our DNA Hold Clues about Disease | Big Data Analytics and Science | Scoop.it
A “deep-learning” algorithm shines a light on mutations in once obscure areas of the genome
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

How Big Data, Business Intelligence and Analytics Are Fueling Mobile Application Development

How Big Data, Business Intelligence and Analytics Are Fueling Mobile Application Development | Big Data Analytics and Science | Scoop.it
You can’t separate successful mobile application development from either data or analytics.
more...
No comment yet.
Rescooped by Dahl Winters from Frontiers of Journalism
Scoop.it!

Legislative Explorer - data patterns of lawmaking

Legislative Explorer - data patterns of lawmaking | Big Data Analytics and Science | Scoop.it
Interactive visualization that allows anyone to explore actual patterns of lawmaking in Congress

Via M. Edward (Ed) Borasky
more...
No comment yet.
Scooped by Dahl Winters
Scoop.it!

Big data scours public records to predict crime

New software allowing for predictive policing may be coming to a police department near you. Beware, made by telecommunications company Intrado, searches billions of records to find and predict...
Dahl Winters's insight:

Only 6 minutes about an innovative use of big data.

more...
No comment yet.