Big Data
241 views | +0 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by Piyas De
Scoop.it!

Hadoop: What it is and why it matters

Hadoop: What it is and why it matters | Big Data | Scoop.it
Learn the basics of Hadoop and how data management, advanced analytics and data visualization can help you make sense of even the biggest data sets.
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

Data-Stats

Map reduce program for implementing statistical analysis.
more...
No comment yet.
Rescooped by Piyas De from JavaScript for Line of Business Applications
Scoop.it!

Goodbye MongoDB, Hello PostgreSQL

Goodbye MongoDB, Hello PostgreSQL | Big Data | Scoop.it
Migrating from MongoDB to PostgreSQL

While we can be extremely proud of what we have achieved so far there was always something lurking in the dark: our primary database. From the start of Olery we’ve had a database setup that involved MySQL for crucial data (users, contracts, etc) and MongoDB for storing reviews and similar data (essentially the data we can easily retrieve in case of data loss). While this setup served us well initially we began experiencing various problems as we grew, in particular with MongoDB. Some of these problems were due to the way applications interacted with the database, some were due to the database itself.


Via Jan Hesse
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

New Version Of InfiniDB Provides Higher Query Performance With Faster Hadoop Integration - Tools Journal

New Version Of InfiniDB Provides Higher Query Performance With Faster Hadoop Integration - Tools Journal | Big Data | Scoop.it
New Version Of InfiniDB Provides Higher Query Performance With Faster Hadoop Integration (New Version Of InfiniDB Provides Higher Query Performance With Faster Hadoop Integration : http://t.co/ybBbx9OXam)...
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

MapR Adds Complete Apache Spark Stack to its Distribution for Hadoop

MapR Adds Complete Apache Spark Stack to its Distribution for Hadoop | Big Data | Scoop.it
Joins forces with Databricks to provide 24x7 support for Spark stack, drive roadmap and accelerate innovation on Spark and related projects MapR Technologies, Inc., provider of the top-ranked distribution for Apache Hadoop, today announced a...
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

Get ready for a flood of new Hadoop apps

Get ready for a flood of new Hadoop apps | Big Data | Scoop.it
Hadoop's 2.0 release includes Yarn, a workload manager that could make it much easier to build and run apps on the open source big data platform (Get ready for a flood of new Hadoop apps http://t.co/V8E76hoi9G...
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

SQL is a Prerequisite Skill for Mainstream Hadoop | #fluentconf - SiliconANGLE (blog)

SQL is a Prerequisite Skill for Mainstream Hadoop | #fluentconf - SiliconANGLE (blog) | Big Data | Scoop.it
SQL is a Prerequisite Skill for Mainstream Hadoop | #fluentconf
SiliconANGLE (blog)
At the Fluent Conference this week, John Furrier, theCube host, invited Gary Nakamura, CEO of Concurrent Inc.
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

Statistics vs Data Science vs BI(Just for Fun)

Statistics vs Data Science vs BI(Just for Fun) | Big Data | Scoop.it
As someone who trained as a statistician, I've always struggled with that title. I love the rigor and insight that Statistics brings to data analysis, but let's face it: Statistics — the name — has always had a bit of a branding problem.
more...
No comment yet.
Rescooped by Piyas De from Corporate Challenge of Big Data
Scoop.it!

Introducing: Project Open Data | The White House

Introducing: Project Open Data | The White House | Big Data | Scoop.it

Last week, President Obama launched the Administration's new Open Data Policy and Executive Order aimed at ensuring that data released by the government will be as accessible and useful as possible.

Project Open Data is an online, public repository intended to foster collaboration and promote the continual improvement of the Open Data Policy.


Via Ian Sykes
more...
No comment yet.
Suggested by Robyn
Scoop.it!

Big Data course by Gauthier Vasseur - Coursmos Blog

Big Data course by Gauthier Vasseur - Coursmos Blog | Big Data | Scoop.it
This course by Gauthier Vasseur will cover all the key elements to engage with confidence in your first steps with Big Data.
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

Reliable, Low-Latency Request-Reply with ZeroMQ

Reliable, Low-Latency Request-Reply with ZeroMQ | Big Data | Scoop.it
Learn how AddThis implemented ZeroMQ for a low-latency request reply solution.
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

Hadoop's rise: Why you don't need petabytes for a big data opening | ZDNet

Hadoop's rise: Why you don't need petabytes for a big data opening | ZDNet | Big Data | Scoop.it
People are often hung up on the volume aspect of big data but other factors can be just as telling in the issues they raise for business.
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

Apache Hadoop 2.4.0 Released! - Hortonworks

Apache Hadoop 2.4.0 Released! - Hortonworks | Big Data | Scoop.it
Announcing Hadoop 2.4.0 - the second release of Hadoop in 2014. (Official release of Hadoop 2.4 by Apache means ACLs for security, automated failover for YARN resource server & more.
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

Apache Tajo SQL-on-Hadoop engine now a top-level project - Gigaom

Apache Tajo, a relational database warehouse system for Hadoop, has graduated to to-level status within the Apache Software Foundation. It might be easy to overlook Tajo because its creators, committers and users are ...
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

What is Hadoop not good for?

What is Hadoop not good for? | Big Data | Scoop.it
Answer (1 of 7): Assuming you're talking about the MapReduce execution system and not HDFS/HBase/etc -- Easy things out of the way first: Real time anything You can use hadoop to do precalculations, but will need to do everything else using...
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

Two forthcoming R books

Two forthcoming R books | Big Data | Scoop.it

The first is Applied Predictive Modeling by Max Kuhn and Kjell Johnson. Max Kuhn is the author of the caret package, an extremely useful and powerful R package for fitting and optimizing all kinds of predictive models in R. It's available now on Amazon Kindle and will be published in hardcover by Springer in July.

The second is Dynamic Documents with R and knitr by Yihui Xie, the author of the knitr package. With knitr you can easily create beautiful documents and reports, with text, tables and figures all dynamically generated by R. It will also be available in July.

more...
No comment yet.
Scooped by Piyas De
Scoop.it!

Defining Hadoop | Big Data | DATAVERSITY

Defining Hadoop | Big Data | DATAVERSITY | Big Data | Scoop.it
Brian Proffitt of ReadWrite recently explained what Hadoop is and how it works. (Defining Hadoop http://t.co/r9OlEJnQOS)
more...
No comment yet.
Scooped by Piyas De
Scoop.it!

The Next Big Thing in Big Data: People Analytics

The Next Big Thing in Big Data: People Analytics | Big Data | Scoop.it

By combining data from both real and virtual worlds, we can now understand behavior at a previously unimaginable scale.

When we use data to uncover the workplace behaviors that make people effective, happy, creative, experts, leaders, followers, early adopters, and so on, we are using “people analytics.”

more...
Jacek Bugajski's curator insight, May 18, 2013 5:31 AM

People Analytics - hmmm... Great idea for companies ;)