Cloud & Big Data ...
Follow
Find
12.3K views | +3 today
 
Cloud & Big Data Platform
Cloud Software, Big Data, Technology, Software Stack
Your new post is loading...
Your new post is loading...
Scooped by Steve Hyounggi Min
Scoop.it!

What are HBase znodes?

more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

GridGain - In-Memory Computing: HBase As Your Persistent Store?

more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Haeinsa is linearly scalable multi-row, multi-table transaction library for HBase. Haeinsa uses two-phase locking and optimistic concurrency control for implementing transaction. The isolation leve...

Haeinsa Overview (HBase Transaction Library)
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

HydraBase – The evolution of HBase@Facebook

HydraBase – The evolution of HBase@Facebook | Cloud & Big Data Platform | Scoop.it
When we revamped Messages in 2010 to integrate SMS, chat, email and Facebook Messages into one inbox, we built the product on open-source Apache HBase, a distributed key value data store running on top of HDFS, and extended it to meet our requirements. At the time, HBase was chosen as the underlying durable data store because it provided the high write throughput and low latency random read performance necessary for our Messages platform. In addition, it provided other important features, including horizontal scalability, strong consistency, and high availability via automatic failover. Since then, we’ve expanded the HBase footprint across Facebook, using it not only for point-read, online transaction processing workloads like Messages, but also for online analytics processing workloads where large data scans are prevalent. Today, in addition to Messages, HBase is used in production by other Facebook services, including our internal monitoring system, the recently launched Nearby Friends feature, search indexing, streaming data analysis, and data scraping for our internal data warehouses..
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Exploring Dynamic Loading of Custom Filters in HBase

Exploring Dynamic Loading of Custom Filters in HBase | Cloud & Big Data Platform | Scoop.it
Any program that pulls data from a large HBase table containing terabytes of data spread over many nodes will need to put a bit of thought into the retrieval of this data. Failure to do this may mean waiting for and subsequently processing a lot of unnecessary data, to the point where it renders this program (whether a single-threaded client or a MapReduce job) useless. HBase’s Scan API helps in this aspect. It configures the parameters of the data retrieval, including the columns to include, start and stop rows and batch sizing.
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Configuring Rack Awareness in Hadoop

Configuring Rack Awareness in Hadoop | Cloud & Big Data Platform | Scoop.it
We are aware of the fact that hadoop divides the data into multiple file blocks and stores them on different machines. If Rack Awareness is not configured, there may be a possibility that hadoop wi...
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

HBase BlockCache 101 - Hortonworks

HBase BlockCache 101 - Hortonworks | Cloud & Big Data Platform | Scoop.it
Understanding BlockCache in HBase.
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

[HADOOP-9771] Improve instructions for Eclipse with m2e plugin in http://wiki.apache.org/hadoop/EclipseEnvironment - ASF JIRA

[HADOOP-9771] Improve instructions for Eclipse with m2e plugin in http://wiki.apache.org/hadoop/EclipseEnvironment - ASF JIRA | Cloud & Big Data Platform | Scoop.it
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

G1GC vs. Concurrent Mark and Sweep Java Garbage Collector

G1GC vs. Concurrent Mark and Sweep Java Garbage Collector | Cloud & Big Data Platform | Scoop.it
Lately, inquiries about the G1GC Java Garbage Collector have been on the rise (e.g., see this search-lucene.com graph).  So last Friday we took one of our Search Analytics servers running HBase Reg...
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

GridGain - In-Memory Computing: GridGain Goes Open Source Under Apache v2.0

GridGain - In-Memory Computing: GridGain Goes Open Source Under Apache v2.0 | Cloud & Big Data Platform | Scoop.it
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

HBase, Cassandra, and MongoDB - How They Recover From a Failure | Architects Zone

HBase, Cassandra, and MongoDB - How They Recover From a Failure | Architects Zone | Cloud & Big Data Platform | Scoop.it
Operational stability and availability are a big deal when your application starts to handle the large, unstructured volumes of data that NoSQL solutions are...
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Big data landscape version 2.0

Here's the second version of our big data landscape. Thoughts, questions, comments? We'd love to hear your feedback in the comments section here: http://wp.me/
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Cassandra CLI Internals Using JArchitect

Cassandra CLI Internals Using JArchitect | Cloud & Big Data Platform | Scoop.it
Cassandra CLI is a useful tool for Cassandra administrators. It's a good example of how to implement a Cassandra client and CLI internals help us to develop custom Cassandra clients or even extend the CLI tool.
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

gridgain « GridGain – In-Memory Computing

gridgain « GridGain – In-Memory Computing | Cloud & Big Data Platform | Scoop.it
Posts about gridgain written by Nikita Ivanov
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

MySQL Slave Scaling (and more) | MariaDB

MySQL Slave Scaling (and more) | MariaDB | Cloud & Big Data Platform | Scoop.it
At Booking.com, we have very wide replication topologies. It is not uncommon to have more than fifty (and sometimes more than a hundred) slaves replicating from the same master. When reaching this number of slaves, one must be careful not to saturate the network interface of the master. A solution exists but it has its weaknesses. We came up with an alternative approach that better fits our needs: the Binlog Server. We think that the Binlog Server can also be used to simplify disaster recovery and to ease promoting a slave as a new master after failure. Read on for more details.
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Raft Consensus Algorithm

Raft Consensus Algorithm | Cloud & Big Data Platform | Scoop.it
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Application Acceleration – Enterprise Flash Memory Platform

Application Acceleration – Enterprise Flash Memory Platform | Cloud & Big Data Platform | Scoop.it
#Delivering the world's data. Faster.
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

HBase Administration, Performance Tuning | Packt Publishing

HBase Administration, Performance Tuning | Packt Publishing | Cloud & Big Data Platform | Scoop.it
Performance is one of the most interesting characteristics of an HBase cluster\'s behavior. It is a challenging operation for administrators, because performance tuning requires deep understanding of not only HBase but also of Hadoop, Java Virtual M
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

HadoopJavaVersions - Hadoop Wiki

more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

急増するLINEインフラの課題と対応 « LINE Engineers' Blog

急増するLINEインフラの課題と対応 «  LINE Engineers' Blog | Cloud & Big Data Platform | Scoop.it
急増するLINEインフラの課題と対応こんにちは。今回はITサービスセンターより、インフラ運営の観点から急増するLINEインフラの課題と対応について記させていただきます。はじめに先日開催したLINE Developer Conference(インフラ編)には大勢の方にいらしていただきました。カンファレ...
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Resources: World's 10 Big HBase Database Cluster Details

more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Creating a virtualized fully-distributed Hadoop cluster using Linux Containers

Creating a virtualized fully-distributed Hadoop cluster using Linux Containers | Cloud & Big Data Platform | Scoop.it
TL;DR Why and how I created a working 9-node Hadoop Cluster on my  laptopIn this post I'll cover why I wanted to have a decent multi-node Hadoop cluster on my laptop, why I chose not to use virtual...
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

High Scalability - High Scalability - The WhatsApp Architecture Facebook Bought For $19 Billion

High Scalability - High Scalability - The WhatsApp Architecture Facebook Bought For $19 Billion | Cloud & Big Data Platform | Scoop.it
Rick Reed in an upcoming talk in March titled That's 'Billion' with a 'B': Scaling to the next ...
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Performance and Fault Tolerance for the Netflix API

1) How Netflix does resilience engineering to tolerate failures and latency. 2) Changes in approach to API architecture to allow optimizing service endpoints to
more...
No comment yet.
Scooped by Steve Hyounggi Min
Scoop.it!

Cloudera says Impala is faster than Hive, which isn't saying much

Cloudera says Impala is faster than Hive, which isn't saying much | Cloud & Big Data Platform | Scoop.it
Cloudera is touting the speed of its Impala query engine compared to Hive and a leading relational database system, but those aren’t really apples-to-apples comparisons.
more...
No comment yet.