Samza is a stream processing system for running continuous computation on infinite streams of data.
Proposal
Samza provides a system for processing stream data from publish-subscribe systems such as Apache Kafka. The developer writes a stream processing task, and executes it as a Samza job. Samza then routes messages between stream processing tasks and the publish-subscribe systems that the messages are addressed to.
Your new post is loading...
Probably yet another hadoop component which will require a deployment of many processes