Big data
859 views | +0 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by Jean-Baptiste Poullet
Scoop.it!

8 SQL-on-Hadoop frameworks worth checking out | Matthew Rathbone

8 SQL-on-Hadoop frameworks worth checking out | Matthew Rathbone | Big data | Scoop.it
A rundown of the common query engines for Hadoop, with some of their pros/cons
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

HBase on low memory cluster

HBase on low memory cluster | Big data | Scoop.it
How to configure HBase on low memory cluster Reduce the number of regions per server Before getting into math, let's recall briefly what the memstore is in HBase. The memstore holds in-memory modif...
more...
No comment yet.
Rescooped by Jean-Baptiste Poullet from MarketingHits
Scoop.it!

Big Data + Big Ideas = Big Impact #bigdata

Peter Fisk explores the world of big data and its impact on business through innovation and marketing, with particular application to the future of telecoms. F…

Via Brian Yanish - MarketingHits.com
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

Big Data : La jungle des différentes distributions open source...

Big Data : La jungle des différentes distributions open source... | Big data | Scoop.it
Aujourd’hui il est difficile de se retrouver dans la jungle d’Hadoop pour les raisons suivantes : - Ce sont des technologies jeunes. - Beaucoup de buzz et de communication de sociétés qui veulent prendre le train Big Data en marche. - Des raccourcis sont souvent... #bigdata #cloudera #hadoop
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

Microsoft Hortonworks partnership will “bring big data to billions” | #BigDataSV | SiliconANGLE

Microsoft Hortonworks partnership will “bring big data to billions” | #BigDataSV | SiliconANGLE | Big data | Scoop.it
more...
No comment yet.
Rescooped by Jean-Baptiste Poullet from Digital Delights - Digital Tribes
Scoop.it!

BigData in HR: Why it's Here and What it Means

BigData in HR: Why it's Here and What it Means | Big data | Scoop.it
The Bersin by Deloitte Analyst Blog

Via Ana Cristina Pratas
more...
Ana Cristina Pratas's curator insight, March 5, 2013 2:14 AM

If you think about the history of analytics in other business areas, the evolution looks like the chart below. When companies started industrializing their manufacturing, they eventually purchased ERP software and developed supply chain and financial analytics.


In the 1970s and 1980s companies started to industrialize their customer marketing and analysis, and we started to focus on "the market of one." This led to a tremendous explosion in CRM and sales analysis systems, which today has become a huge industry in customer segmentation and marketing analytics.


Now, given the global recession and talent imbalances in the world, companies are focusing on replacing their legacy HR systems to help apply analytics reasoning to HR and talent. As the chart shows, in each of these evolutions we started with reporting and core understanding and then moved to predictive analytics. This is what is happening in HR.

Rescooped by Jean-Baptiste Poullet from Cloud Central
Scoop.it!

Bossie Awards 2014: The best open source big data tools

Bossie Awards 2014: The best open source big data tools | Big data | Scoop.it
InfoWorld's top picks in distributed data processing, data analytics, machine learning, NoSQL databases, and the Hadoop ecosystem

Via Peter Azzopardi
more...
Peter Azzopardi's curator insight, November 10, 2014 5:46 PM

This year's Bossies in big data track important new developments in the Hadoop stack, underscore a maturing NoSQL space, and highlight a number of useful tools for data wrangling, data analysis, and machine learning.

Rescooped by Jean-Baptiste Poullet from Scala & Cloud Playing
Scoop.it!

Node.js vs Play Framework

Here's the showdown you've been waiting for: Node.js vs Play Framework.

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by Jean-Baptiste Poullet from BigData NoSql and Data Stuff
Scoop.it!

Big Data Benchmark - Redshift, Hive, Shark, Impala, Tez

Big Data Benchmark - Redshift, Hive, Shark, Impala, Tez | Big data | Scoop.it

Several analytic frameworks have been announced in the last year. Among them are inexpensive data-warehousing solutions based on traditional Massively Parallel Processor (MPP) architectures (Redshift), systems which impose MPP-like execution engines on top of Hadoop (Impala, HAWQ) and systems which optimize MapReduce to improve performance on analytical workloads (Shark, Stinger/Tez). This benchmark provides quantitativeand qualitative comparisons of five systems. It is entirely hosted on EC2 and can be reproduced directly from your computer.

Redshift - a hosted MPP database offered by Amazon.com based on the ParAccel data warehouse. We tested Redshift on HDDs.Hive - a Hadoop-based data warehousing system. (v0.12)Shark - a Hive-compatible SQL engine which runs on top of the Spark computing framework. (v0.8.1)Impala - a Hive-compatible* SQL engine with its own MPP-like execution engine. (v1.2.3)Stinger/Tez - Tez is a next generation Hadoop execution engine currently in development (v0.2.0)

This remains a work in progress and will evolve to include additional frameworks and new capabilities. We welcome contributions.

What this benchmark is not

This benchmark is not intended to provide a comprehensive overview of the tested platforms. We are aware that by choosing default configurations we have excluded many optimizations. The choice of a simple storage format, compressed SequenceFile, omits optimizations included in columnar formats such as ORCFile and Parquet. For now, we've targeted a simple comparison between these systems with the goal that the results areunderstandable and reproducible.

What is being evaluated?

This benchmark measures response time on a handful of relational queries: scans, aggregations, joins, and UDF's, across different data sizes. Keep in mind that these systems have very different sets of capabilities. MapReduce-like systems (Shark/Hive) target flexible and large-scale computation, supporting complex User Defined Functions (UDF's), tolerating failures, and scaling to thousands of nodes. Traditional MPP databases are strictly SQL compliant and heavily optimized for relational queries. The workload here is simply one set of queries that most of these systems these can complete.


Via Alex Kantone
more...
No comment yet.
Rescooped by Jean-Baptiste Poullet from Technology in Business Today
Scoop.it!

A list of Digital Marketing Tools for your Business

A list of Digital Marketing Tools for your Business | Big data | Scoop.it
A list of Digital Marketing Tools for your Business

Via TechinBiz
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

How to use MongoDB with SSL

How to use MongoDB with SSL | Big data | Scoop.it
Data security, encryption and privacy is all around the news these days. Deutsche Telekom is talking about creating "Schlandnet" and HTTP 2.0 is going all in with SSL. In times where everybody is c...
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

The evolving HBase ecosystem - hadoopsphere.com

The evolving HBase ecosystem - hadoopsphere.com | Big data | Scoop.it
hadoopsphere.com: The evolving HBase ecosystem
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

Gartner Says New Relationships Will Change Business Intelligence and Analytics

Gartner Says New Relationships Will Change Business Intelligence and Analytics | Big data | Scoop.it
Business intelligence (BI) and analytics leaders need to embrace four trends that are set to challenge traditional assumptions about these technology areas, according to Gartner, Inc. (The relentless march of predictive analytics...
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

Spark or Hadoop: is it an either-or proposition? By Slim Baltagi

An exodus away from Hadoop to Spark is picking up steam in the news headlines and talks! Away from marketing fluff and politics, this talk analyzes such news a…
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

HP adds scale to open-source R in latest big data platform | ZDNet

HP adds scale to open-source R in latest big data platform | ZDNet | Big data | Scoop.it
HP introduced Haven Predictive Analytics, an open-sourced big data platform that it says will bring more machine learning and statistical analysis power to companies dealing with large data volumes.
more...
No comment yet.
Rescooped by Jean-Baptiste Poullet from BIG data, Data Mining, Predictive Modeling, Visualization
Scoop.it!

Inside-BigData.com | Discovering Gold with Big Data Analytics and Data-Intensive Computing

Inside-BigData.com | Discovering Gold with Big Data Analytics and Data-Intensive Computing | Big data | Scoop.it
insideBIGDATA: Startup news (Video: Immortality, Big Data, and Tattoos | http://t.co/05gjkII2 http://t.co/OHUhoi5W #BigData...)...

Via AnalyticsInnovations
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

In 2015, big data will drive the Internet of things

In 2015, big data will drive the Internet of things | Big data | Scoop.it
Connected objects are not smart on their own, they only provide value when connected to a backend (often cloud-based) service. The recent advances in big data technologies enables providers of connected objects to deliver high-value services from and through their objects
more...
No comment yet.
Rescooped by Jean-Baptiste Poullet from Data Nerd's Corner
Scoop.it!

#BigData100: Vote Now for the Most Influential in Big Data - Saul Sherry | Big Data Republic

#BigData100: Vote Now for the Most Influential in Big Data - Saul Sherry | Big Data Republic | Big data | Scoop.it
Big Data Republic opens campaign to find the most influential big data tweeters.

Via Carla Gentry CSPO
more...
Carla Gentry CSPO's curator insight, March 19, 2013 2:23 PM

Guess who is # 6 on one list and #9 on the other - So appreciative to my peers! Thank You!!

 

https://alist.traackr.com/datascience http://www.bigdatarepublic.com/author.asp?section_id=2642&doc_id=260536
Rescooped by Jean-Baptiste Poullet from Data Nerd's Corner
Scoop.it!

Using Big Data To Fight Pandemics

Using Big Data To Fight Pandemics | Big data | Scoop.it
People’s efforts have understandably been focused elsewhere. This week at the ITU Plenipotentiary Conference in Busan, the International Telecommunication Union (ITU), the GSMA and the Internet Society (ISOC) announced that they are joining forces in the fight against Ebola. This unity is an essential step forward, but along with the GSMA, United Nations Global Pulse, and a number of other data scientists, I really want to make sure we, and most importantly the African mobile operators, address this opportunity and truly harness the potential of the data available.

Via Carla Gentry CSPO
more...
Carla Gentry CSPO's curator insight, November 11, 2014 3:42 PM

Of course mobile data analytics cannot directly assist the heroic work of doctors and nurses who are on the ground, but it could prove extremely helpful when it comes to planning resource allocation or understanding the effectiveness of different mobility containment measures.

Rescooped by Jean-Baptiste Poullet from Scala & Cloud Playing
Scoop.it!

Hortonworks Sandbox R and RStudio install

Hortonworks Sandbox R and RStudio install | Big data | Scoop.it
The blogspot blog here tells how to modify a Cloudera VM to include R and RStudio in the VM as well as the RHadoop library. This document shows some modifications to the steps to support the Horton...

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by Jean-Baptiste Poullet from Business Brainpower with the Human Touch
Scoop.it!

BigData in Human Resources: Talent Analytics Comes of Age

BigData in Human Resources: Talent Analytics Comes of Age | Big data | Scoop.it

There are around 160 million workers in the US alone, and most companys’ largest expense is payroll. In fact in most businesses payroll is 40% or more of total revenue, meaning that total US payroll expense is many billions of dollars.

How well do organizations truly understand what drives performance among their workforce? The answer: not really very well. Do we know why one sales person outperforms his peers? Do we understand why certain leaders thrive and others flame out? Can we accurately predict whether a candidate will really perform well in our organization?


Via The Learning Factor
more...
The Learning Factor's curator insight, February 17, 2013 9:34 PM

A great article on the use of BigData in HR. Data definately tells us that we may have been looking at the wrong things in relation to a number of HR practices. Take a look at a good example of recruiting sales people.

Scooped by Jean-Baptiste Poullet
Scoop.it!

Have You Seen Spring Lately? | Pivotal P.O.V.

Have You Seen Spring Lately? | Pivotal P.O.V. | Big data | Scoop.it
RT @springcentral: Share this with people that still think spring == DI! http://t.co/vry95Fao0H #springio #springboot #springxd #hadoop #R…
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

Cascading 2.5 Supports Hadoop 2

Cascading 2.5 Supports Hadoop 2 | Big data | Scoop.it
New version of Cascading released this week incorporates Hadoop 2 support and includes Cascading Lingual - an open source project that provides a comprehensive ANSI SQL interface for accessing Hadoop-based data...
more...
No comment yet.
Scooped by Jean-Baptiste Poullet
Scoop.it!

Hadoop + Ubuntu: The Big Fat Wedding | SmartData Collective

Hadoop + Ubuntu: The Big Fat Wedding | SmartData Collective | Big data | Scoop.it
Last month, Canonical, the organization behind the Ubuntu operating system, partnered with MapR, one of the Hadoop heavyweights, in an effort to make Hadoop available as an integrated part of Ubuntu through its repositories.
more...
No comment yet.