Data and Distributed Architectures
152 views | +0 today
Follow
Data and Distributed Architectures
Data and Distributed Architectures
Curated by Mathieu D
Your new post is loading...
Your new post is loading...
Scooped by Mathieu D
Scoop.it!

Databricks TechTalk: Spark Dataframes for Large-Scale Data Science - YouTube

Data frames in R and Python have become the de facto standards for data science. However, when it comes to Big Data, neither R nor Python data frames integra...
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

2013 July 23 Toronto Hadoop User Group Hive Tuning

Hive Deep Dive, Hive 0.11 Tuning tips, Hive 0.11 performance optimizations, and Tez
Mathieu D's insight:

a bit old (hive 0.11) but a really detailed one

more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Elasticsearch from the bottom up - YouTube

This talk will teach you about Elasticsearch and Lucene's architecture. The key data structure in search is the powerful inverted index, which is actually si...
Mathieu D's insight:

This video is a good start to understand what Elastic Search brings on top of Lucene.

more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Integrating with Hadoop: Import & Export Data with the Couchbase Sqoop Plugin – Couchbase

Integrating with Hadoop: Import & Export Data with the Couchbase Sqoop Plugin – Couchbase | Data and Distributed Architectures | Scoop.it
Learn how to design and implement big data solutions with document databases, Apache Hadoop, and more. We’ll being with a discussion on real-time big data architecture with an emphasis on real-time data access, distributed messaging, stream processing, and offline analysis. We’ll finish by explaining and demonstrating how to integrate Apache Hadoop with the Couchbase Server …
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Lightning fast analytics with Spark and Cassandra

Introduction to using Spark with Cassandra.
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Apache Cassandra Data Modeling with Travis Price

Learn how to model your data with Apache Cassandra by Travis Price
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

How Flash changes the design of database storage engines - O'Reilly Radar

How Flash changes the design of database storage engines - O'Reilly Radar | Data and Distributed Architectures | Scoop.it
Over the past decade, SSD drives (popularly known as Flash) have radically changed computing at both the consumer level — where USB sticks have effectively replaced CDs for...
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

dotScale 2014 - Laurence Moroney - Everything I learned about dynamic scaling of cloud apps. - YouTube

Filmed at http://dotscale.io in Paris on May 19, 2014. More talks at http://dotconferences.io Laurence is a developer advocate at Google and a science fictio...
Mathieu D's insight:

Nice talk about scaling up AND down fast. And some god recos about NoSQL KV design for scalability

more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Cassandra architecture and performance, mid 2014

Cassandra architecture and performance, mid 2014 | Data and Distributed Architectures | Scoop.it
DataStax – Software, support, and training for Apache Cassandra
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

DBMS Musings: Problems with CAP, and Yahoo’s little known NoSQL system

DBMS Musings: Problems with CAP, and Yahoo’s little known NoSQL system | Data and Distributed Architectures | Scoop.it

Interesting reflections about CAP theorem and its limits.

Proposes a new framework to discuss around Consistency/Avalability/Partition-tolerance/Latency...

more...
No comment yet.
Rescooped by Mathieu D from MongoDB Gems
Scoop.it!

Distributed Algorithms in NoSQL Databases

Distributed Algorithms in NoSQL Databases | Data and Distributed Architectures | Scoop.it

Scalability is one of the main drivers of the NoSQL movement. As such, it encompasses distributed system coordination, failover, resource management and many other capabilities.


Via Davis Tan
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Software Development & Architecture @ LinkedIn

Software Development & Architecture @ LinkedIn | Data and Distributed Architectures | Scoop.it
Sid Anand discusses the architectural and development practices adopted by LinkedIn as a continuous growing company.
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Lessons from Highly Scalable Architectures at Social Networking Sites

What are the techniques and technolgies used by popular social networking sites such as Facebook, Twitter, Tumblr, Pinterest or Instagram? How do they architec…
Mathieu D's insight:

Woot ! Heads up !

Excellent deck of slides on highly scalable architectures 

more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

A N1QL for Every Query: Extending SQL to a Document Database – Couchbase

A N1QL for Every Query: Extending SQL to a Document Database – Couchbase | Data and Distributed Architectures | Scoop.it
This session will provide an overview, update, and demo of N1QL, the upcoming query language from Couchbase. In addition to introducing N1QL and the current developer preview available at query.couchbase.com, we will discuss and show exciting new features as N1QL advances towards beta and production release.
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

The next generation storage engine for Couchbase Server and Couchbase Lite: ForestDB. Now available in Beta! | Couchbase Blog

The next generation storage engine for Couchbase Server and Couchbase Lite: ForestDB. Now available in Beta! | Couchbase Blog | Data and Distributed Architectures | Scoop.it
ForestDB project is an open source embeded key/value storage engine with great performance and space efficiency. The project started implementation a year ago. The main objective was to address the main drawbacks of typical B+-Tree index structure and push disk IO performance to the next level. 
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Realtime Trending Analysis with Approximate Algorithms

Realtime Trending Analysis with Approximate Algorithms | Data and Distributed Architectures | Scoop.it
When we hear about trending, twitter trending immediately comes to mind. However, there are many other scenarios, where such analysis is applicable. Some example  use cases  are 1. Top 5 videos wat...
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

NoSQL and ACID / Dave Rosenthal from FoundationDB

This presentation, given by Dave Rosenthal at NoSQL Now! 2013, presents the case for why he believes NoSQL databases will need to support ACID transactions in …
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Apache Kafka 0.8 basic training - Verisign

Apache Kafka 0.8 basic training (120 slides) covering: 1. Introducing Kafka: history, Kafka at LinkedIn, Kafka adoption in the industry, why Kafka 2. Kafka cor…
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

Cassandra Day 2014: Interactive Analytics with Cassandra and Spark

Take your analytics to the next level by using Apache Spark to accelerate complex interactive analytics using your Apache Cassandra data. Includes an introdu…
more...
No comment yet.
Scooped by Mathieu D
Scoop.it!

OCTO talks ! » Data Grid or NoSQL ? same, same but different…

OCTO talks ! » Data Grid or NoSQL ? same, same but different… | Data and Distributed Architectures | Scoop.it
OCTO Technology blog...
more...
No comment yet.