EEDSP
Follow
Find tag "apache"
13.4K views | +7 today
EEDSP
Digital Signal Processing, Data Analytics, Big Data, HPC, Deep Learning, GPGPU
Curated by Shiwon Cho
Your new post is loading...
Your new post is loading...
Scooped by Shiwon Cho
Scoop.it!

HBase ZK-less Region Assignment : Apache HBase

HBase ZK-less Region Assignment : Apache HBase | EEDSP | Scoop.it

Recently, we changed how HBase assigns regions. This architectural change is referred to as ZK-less region assignment, i.e. assigning regions without involving ZooKeeper. The change allows us to achieve greater scale as well as faster startups and assignment. It is simpler and has less code, improves the speed at which assignments run so we can do faster rolling restarts. The master is also re-architected to handle more regions. This feature will be on by default in HBase 2.0.0.


more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache Mahout, Hadoop's original machine learning project, is moving on from MapReduce

Apache Mahout, Hadoop's original machine learning project, is moving on from MapReduce | EEDSP | Scoop.it
The Apache Mahout project will now support Apache Spark and another data engine called H20 as it tries to retain its status as the go-to set of machine learning libraries for Hadoop.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The Apache Software Foundation Announces Apache™ Spark™ as a Top-Level Project : The Apache Software Foundation Blog

The Apache Software Foundation Announces Apache™ Spark™ as a Top-Level Project

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache CloudStack: Open Source Cloud Computing

Apache CloudStack: Open Source Cloud Computing | EEDSP | Scoop.it

Apache CloudStack is open source software designed to deploy and manage large networks of virtual machines, as a highly available, highly scalable Infrastructure as a Service (IaaS) cloud computing platform. CloudStack is used by a number of service providers to offer public cloud services, and by many companies to provide an on-premises (private) cloud offering, or as part of a hybrid cloud solution.

CloudStack is a turnkey solution that includes the entire "stack" of features most organizations want with an IaaS cloud: compute orchestration, Network-as-a-Service, user and account management, a full and open native API, resource accounting, and a first-class User Interface (UI).

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache Camel web dashboard with hawtio | Architects Zone

Apache Camel web dashboard with hawtio | Architects Zone | EEDSP | Scoop.it

hawtio is a lightweight and modular HTML5 web console for managing your Java stuff

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

How-to: Do Apache Flume Performance Tuning (Part 1)

How-to: Do Apache Flume Performance Tuning (Part 1) | EEDSP | Scoop.it
Cloudera offers enterprises a powerful new data platform built on the popular Apache Hadoop open-source software package.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache Jena - Apache Jena

Apache Jena - Apache Jena | EEDSP | Scoop.it

Apache Jena™ is a Java framework for building Semantic Web applications. Jena provides a collection of tools and Java libraries to help you to develop semantic web and linked-data apps, tools and servers.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

2.1 Spotlight: Camel is Back

Apache Camel has been completely rewritten for Akka.
Camel is an open-source framework based on known Enterprise Integration Patterns and it is great for integrating Akka systems with other external systems. The Akka Camel module provides an easy to use bridge between actors and Camel endpoints.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache 2.x Modules In C++ (Part 2) - CodeProject

Part 2 - Stepping into the C++ world; Author: Andy Kirkham; Updated: 12 Nov 2012; Section: C / C++ Language; Chapter: Languages; Updated: 12 Nov 2012...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache CloudStack 4.0.0-incubating Released : Cloudstack

The Apache CloudStack project is pleased to announce the 4.0.0-incubating release of the CloudStack Infrastructure-as-a-Service (IaaS) cloud orchestration platform. This is the first release from within the Apache Incubator, the entry path into the Apache Software Foundation (ASF).

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

About Apache Flume FileChannel

About Apache Flume FileChannel | EEDSP | Scoop.it
Cloudera offers enterprises a powerful new data platform built on the popular Apache Hadoop open-source software package.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache to Drill for big data in Hadoop - The H Open: News and Features

Apache to Drill for big data in Hadoop - The H Open: News and Features | EEDSP | Scoop.it
The Apache Incubator's newest arrival is Drill which plans to become a platform for interactive, near realtime, analysis of big data, in a project inspired by Google's Dremel technology...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Connecting Redis to Solr for boosting documents :: Kelvin Tan - Lucene Solr crawl Consultant

There are a number of instances in Solr where it's desirable to retrieve data from an external datastore for boosting purposes instead of trying to contort Solr with multiple queries, joins etc.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Announcing Hive 1.0: A Stable Moment in Time

Announcing Hive 1.0: A Stable Moment in Time | EEDSP | Scoop.it
Apache Hive Community releases Hive 1.0, an immense stride toward stability and reliability.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache Oozie Workflow Scheduler for Hadoop

Oozie is a workflow scheduler system to manage Apache Hadoop jobs.

Oozie Workflow jobs are Directed Acyclical Graphs (DAGs) of actions.

Oozie Coordinator jobs are recurrent Oozie Workflow jobs triggered by time (frequency) and data availabilty.

Oozie is integrated with the rest of the Hadoop stack supporting several types of Hadoop jobs out of the box (such as Java map-reduce, Streaming map-reduce, Pig, Hive, Sqoop and Distcp) as well as system specific jobs (such as Java programs and shell scripts).

Oozie is a scalable, reliable and extensible system.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache Spark for Big Analytics

Apache Spark for Big Analytics | EEDSP | Scoop.it
by Thomas Dinsmore, Director of Product Management at Revolution Analytics The emergence of Apache Spark is a key development for Big Analytics in 2013.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache OpenNLP - Welcome to Apache OpenNLP

Apache OpenNLP is a Java machine learning toolkit for natural language processing (NLP).
It supports the most common NLP tasks.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache UIMA - Apache UIMA

Apache UIMA - Apache UIMA | EEDSP | Scoop.it

Unstructured Information Management applications are software systems that analyze large volumes of unstructured information in order to discover knowledge that is relevant to an end user. An example UIM application might ingest plain text and identify entities, such as persons, places, organizations; or relations, such as works-for or located-at.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Lucene - Quickly add Index and Search Capability | Java Code Geeks

Lucene - Quickly add Index and Search Capability | Java Code Geeks | EEDSP | Scoop.it

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Enterprise-ready Tool Support for Apache Camel | Javalobby

Enterprise-ready Tool Support for Apache Camel | Javalobby | EEDSP | Scoop.it
Apache Camel is my favorite integration framework on the Java
platform due to great DSLs, a huge community, and so many different
components. Camel is used...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Kiji Community - Build Real-Time Scalable Data Applications on Apache HBase

Kiji Community - Build Real-Time Scalable Data Applications on Apache HBase | EEDSP | Scoop.it

Build Real-Time Scalable Data Applications on Apache HBase

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache 2.x Modules In C++ (Part 1) - CodeProject

Apache 2.x Modules In C++ (Part 1) - CodeProject | EEDSP | Scoop.it
Part 1 - Setting up and getting started; Author: Andy Kirkham; Updated: 10 Nov 2012; Section: C / C++ Language; Chapter: Languages; Updated: 10 Nov 2012...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache Lucene and Solr 4.0 Released | Architects Zone

Apache Lucene and Solr 4.0 Released | Architects Zone | EEDSP | Scoop.it
After three years of development we can finally say that Apache Lucene library and Solr server 4.0 have been released.
Full list of changes in Apache Lucene...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Introducing Apache Apollo: Part I | Javalobby

Introducing Apache Apollo: Part I | Javalobby | EEDSP | Scoop.it
Apache Apollo is the next-generation version of ActiveMQ built “from the ground up” on a core designed to be faster and scale better on multi-processor...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache ZooKeeper 3.3.6 has been released

Cloudera offers enterprises a powerful new data platform built on the popular Apache Hadoop open-source software package.
more...
No comment yet.