Bits 'n Pieces on Big Data
1.3K views | +0 today
Follow
Bits 'n Pieces on Big Data
Innovative information and insight into Big Data (if you like the content, please consider donating to my bitcoin address #3Pjof6N9xRAYXXSPZ4EAFLfHGn51ZdPcxi)
Curated by onur savas
Your new post is loading...
Your new post is loading...
Rescooped by onur savas from Big Data and NoSQL Daily
Scoop.it!

Apache Spark for Big Analytics

Apache Spark for Big Analytics | Bits 'n Pieces on Big Data | Scoop.it

Via Simon Hunanyan
more...
Simon Hunanyan's curator insight, December 23, 2013 10:09 PM

Spark, an Apache incubator project, is an open source distributed computing framework for advanced analytics in Hadoop. It's 100X faster than what they are able to achieve with MapReduce. Spark includes a machine learning library (MLLib), a graph engine (GraphX), a streaming analytics engine (Spark Streaming) and much more...

Currently, Spark supports programming interfaces for Scala, Java and Python.  The R interface is under development and this is expected to be released in the first half of 2014.

Scooped by onur savas
Scoop.it!

It Is Trivially Easy to Match Metadata to Real People

It Is Trivially Easy to Match Metadata to Real People | Bits 'n Pieces on Big Data | Scoop.it

"...We randomly sampled 5,000 numbers from our crowdsourced MetaPhone dataset and queried the Yelp, Google Places, and Facebook directories. With little marginal effort and just those three sources—all free and public—we matched 1,356 (27.1%) of the numbers. Specifically, there were 378 hits (7.6%) on Yelp, 684 (13.7%) on Google Places, and 618 (12.3%) on Facebook..."

more...
No comment yet.
Scooped by onur savas
Scoop.it!

Manifact - Real-time Metrics - Statistics - Alerts

Manifact - Real-time Metrics - Statistics - Alerts | Bits 'n Pieces on Big Data | Scoop.it

Real-time Metrics, Statistics and Alerts. Works with MySql, Oracle, PostgreSql, and Hadoop.

more...
No comment yet.
Scooped by onur savas
Scoop.it!

Big Data – Lessons from Genetics and Bio-Statistics

Big Data – Lessons from Genetics and Bio-Statistics | Bits 'n Pieces on Big Data | Scoop.it
The chimpanzee genome is more than 95 % identical to the human genome – by Roopam

Evolution of Biology & Medicine with Big Data
I learnt a couple…
more...
No comment yet.
Scooped by onur savas
Scoop.it!

Twitter buzz about papers does not mean citations later

Twitter buzz about papers does not mean citations later | Bits 'n Pieces on Big Data | Scoop.it
Analysis of science on social media service finds little correlation with standard measures of academic success.
more...
No comment yet.
Scooped by onur savas
Scoop.it!

Netflix open sources its data traffic cop, Suro

Netflix open sources its data traffic cop, Suro | Bits 'n Pieces on Big Data | Scoop.it
Netflix has open sourced a tool called Suro that collects event data from disparate application servers before sending them to other data platforms such as Hadoop and Elasticsearch.
more...
No comment yet.
Scooped by onur savas
Scoop.it!

Self-Replicating USBs Spread Software Faster than an Internet Connection | MIT Technology Review

Self-Replicating USBs Spread Software Faster than an Internet Connection | MIT Technology Review | Bits 'n Pieces on Big Data | Scoop.it
Downloading free software is hugely time consuming and expensive in the developing world. Now one computer scientist has worked out how to spread it faster and more cheaply without using the internet.
more...
No comment yet.
Scooped by onur savas
Scoop.it!

YouTube Multiview Video Games Dataset

YouTube Multiview Video Games Dataset | Bits 'n Pieces on Big Data | Scoop.it

This dataset contains about 120k instances, each described by 13 feature types, with class information, specially useful for exploring multiview topics (cotraining, ensembles, clustering,..).

more...
No comment yet.
Rescooped by onur savas from Talks
Scoop.it!

What Big Data Means For Social Science

We've known big data has had big impacts in business, and in lots of prediction tasks. I want to understand, what does big data mean for what we do for science? Specifically, I want to think about the following context:  You have a scientist who has a hypothesis that they would like to test, and I want to think about how the testing of that hypothesis might change as data gets bigger and bigger. So that's going to be the rule of the game. Scientists start with a hypothesis and they want to test it; what's going to happen?

 


Via Alessandro Cerboni, NESS, Complexity Digest
more...
Scooped by onur savas
Scoop.it!

Governing Algorithms: A Provocation Piece

Governing Algorithms: A Provocation Piece | Bits 'n Pieces on Big Data | Scoop.it

"Algorithms have developed into somewhat of a modern myth. They “compet[e] for our living rooms” (Slavin 2011), “determine how a billion plus people get where they’re going” (McGee 2011), “have already written symphonies as moving as those composed by Beethoven” (Steiner 2012), and “free us from sorting through multitudes of irrelevant results” (Spring 2011). Nevertheless, the nature and implications of such orderings are far from clear. What exactly is it that algorithms “do”? What is the role attributed to “algorithms” in these arguments? How can we turn the “problem of algorithms” into an object of productive inquiry? This paper sets out to trouble the coherence of the algorithm as an analytical category and explores its recent rise in scholarship, policy, and practice through a series of provocations.

onur savas's insight:

You can download the paper form the very link.

more...
No comment yet.
Scooped by onur savas
Scoop.it!

How Big Data Can Transform Society for the Better

How Big Data Can Transform Society for the Better | Bits 'n Pieces on Big Data | Scoop.it
The digital traces we leave behind each day reveal more about us than we know. This could become a privacy nightmare—or it could be the foundation of a healthier, more prosperous world
onur savas's insight:

By M.I.T. computer scientist Alex Pentland. Video based on the article: http://www.scientificamerican.com/article.cfm?id=pentland-how-exactly-is-big-data-going-to-change-the-world-video


more...
No comment yet.
Scooped by onur savas
Scoop.it!

New data science institute at UC Berkeley to help scholars harness ‘big data’

New data science institute at UC Berkeley to help scholars harness ‘big data’ | Bits 'n Pieces on Big Data | Scoop.it
The Moore and Sloan Foundations announced on Nov. 12 a new data science initiative to advance data-driven scholarship in the social and physical sciences.
more...
No comment yet.
Rescooped by onur savas from Big Data Security Analytics
Scoop.it!

Solving NetFlow analysis challenges with big data approach

Solving NetFlow analysis challenges with big data approach | Bits 'n Pieces on Big Data | Scoop.it
Admins looking for more efficient NetFlow analysis can find network management tools that offer finer detail and higher performance without relying on relational database platforms.

Via cysap
more...
No comment yet.
Scooped by onur savas
Scoop.it!

The Predictive Power of Big Data

The Predictive Power of Big Data | Bits 'n Pieces on Big Data | Scoop.it
If you wrote out all five zettabytes of data produced by humans produce each year, it would reach the galactic core of the Milky Way.
more...
No comment yet.
Scooped by onur savas
Scoop.it!

Schedule: All sessions: Strata 2014 - O'Reilly Conferences, February 11 - 13, 2014, Santa Clara, CA

Schedule: All sessions: Strata 2014 - O'Reilly Conferences, February 11 - 13, 2014, Santa Clara, CA | Bits 'n Pieces on Big Data | Scoop.it
more...
No comment yet.
Scooped by onur savas
Scoop.it!

The Cloud, Visualizations and Apps: Wikibon’s Big Data Predictions for 2014 | The Big Data Hub

The Cloud, Visualizations and Apps: Wikibon’s Big Data Predictions for 2014 | The Big Data Hub | Bits 'n Pieces on Big Data | Scoop.it
The Cloud, Visualizations and Apps: Wikibon’s Big Data Predictions for 2014
more...
No comment yet.
Scooped by onur savas
Scoop.it!

Big Data Social Science Research Program at Penn State

Big Data Social Science Research Program at Penn State | Bits 'n Pieces on Big Data | Scoop.it

 Integrative Graduate Education and Research Traineeship at Penn State.

more...
No comment yet.
Scooped by onur savas
Scoop.it!

Project ranks billions of drug interactions

Project ranks billions of drug interactions | Bits 'n Pieces on Big Data | Scoop.it

“It’s the largest computational docking ever done by mankind.”


By analysing the chemical structure of a drug, researchers can see if it is likely to bind to, or ‘dock’ with, a biological target such as a protein. Researchers have now unveiled a computational effort that used Google's supercomputers to assesses billions of potential dockings on the basis of drug and protein information held in public databases, finding potentially toxic side effects and allowing researchers to predict how and where a compound might work in the body.

more...
No comment yet.
Scooped by onur savas
Scoop.it!

IBM Data Magazine

IBM Data Magazine | Bits 'n Pieces on Big Data | Scoop.it
Clear, in-depth technical advice and hands-on examples on the latest topics in data management, IBM databases, and big data.
more...
No comment yet.
Scooped by onur savas
Scoop.it!

SCAPE - SCAlable Preservation Environments

SCAPE - SCAlable Preservation Environments | Bits 'n Pieces on Big Data | Scoop.it

The SCAPE project will develop scalable services for planning and execution of institutional preservation strategies on an open source platform that orchestrates semi-automated workflows for large-scale, heterogeneous collections of complex digital objects. SCAPE will enhance the state of the art of digital preservation in three ways: by developing infrastructure and tools for scalable preservation actions; by providing a framework for automated, quality-assured preservation workflows and by integrating these components with a policy-based preservation planning and watch system. These concrete project results will be validated within three large-scale Testbeds from diverse application areas.

onur savas's insight:

SCAPE Public Wiki: http://wiki.opf-labs.org/display/SP/Home

 

Also, upcoming "Hadoop Driven Digital Preservation" agenda: http://wiki.opf-labs.org/display/SP/Agenda+-+Hadoop+Driven+Digital+Preservation

more...
No comment yet.
Scooped by onur savas
Scoop.it!

The Future of the Statistical Sciences Workshop

The Future of the Statistical Sciences Workshop | Bits 'n Pieces on Big Data | Scoop.it
Workshop at a Glance: WHAT: Future of the Statistical Sciences Workshop WHEN: November 11-12, 2013 WHERE: Royal Statistical Society Offices, London, England About the Future of the Statistical Sciences Workshop The capstone event of the International...
more...
No comment yet.
Scooped by onur savas
Scoop.it!

Amazon wades into big data streams with Kinesis

Amazon wades into big data streams with Kinesis | Bits 'n Pieces on Big Data | Scoop.it
Service is designed to take on data firehoses like Twitter's in AWS' cloud.
more...
No comment yet.
Scooped by onur savas
Scoop.it!

NASA brings Earth science 'big data' to the cloud with Amazon web services

NASA brings Earth science 'big data' to the cloud with Amazon web services | Bits 'n Pieces on Big Data | Scoop.it
(Phys.org) —NASA and Amazon Web Services Inc. (AWS) of Seattle, Wash., are making a large collection of NASA climate and Earth science satellite data available to research and educational users through the AWS cloud.
more...
No comment yet.