Scalding, Functional Programming, and Data Science
1.4K views | +0 today
Scalding, Functional Programming, and Data Science
My bookmarks on stuff related to Scalding, functional programming, and data science.
Curated by Patrick Ting
Your new post is loading...
Your new post is loading...
Scooped by Patrick Ting
Scoop.it!

Of Algebirds, Monoids, Monads, and other Bestiary for Large-Scale Data Analytics - Michael G. Noll

Of Algebirds, Monoids, Monads, and other Bestiary for Large-Scale Data Analytics - Michael G. Noll | Scalding, Functional Programming, and Data Science | Scoop.it
An introduction to Twitter's Algebird library, an abstract algebra for Scala.
Patrick Ting's insight:

Best explaination of monads I've read through yet!

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Introducing Pinterest Secor

Introducing Pinterest Secor | Scalding, Functional Programming, and Data Science | Scoop.it
Introducing Pinterest Secor
Patrick Ting's insight:

Another interesting lambda architecture related project

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

kijiproject/wiki

kijiproject/wiki | Scalding, Functional Programming, and Data Science | Scoop.it
Kiji project-wide developer wiki
Patrick Ting's insight:

Cool project and cool wiki page on designing APIs

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Akka at Conspire [Part 1]: How We Built Our Backend on Akka and Scala

Akka at Conspire [Part 1]: How We Built Our Backend on Akka and Scala | Scalding, Functional Programming, and Data Science | Scoop.it
How do three people build a scalable backend in a language only one has ever used before in just three months? By getting something else to do the heavy lifting.
(Spoilers: Paul learned Scala, you can...
Patrick Ting's insight:

Really good 5 part blog post on Akka + Scala.

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Scalding 0.9.0 and Beyond - YouTube

From matrices, to join algorithms, to columnar storage, this talk presented by Twitter's Oscar Boykin will introduce several new features in scalding 0.9.0 a...
more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Abstractivate: Scala: the global ExecutionContext makes your life easier

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Apache Spark: The Next Big Data Thing?

Apache Spark: The Next Big Data Thing? | Scalding, Functional Programming, and Data Science | Scoop.it
Apache Spark is generating quite some buzz right now.
Databricks, the company founded to support Spark raised $14M from
Andreessen Horowitz, Cloudera has decided to fully support Spark,
and others chime in that it's the next big thing. So
I thought it's high time I took a look to get an understanding what the whole buzz is around.
Patrick Ting's insight:

Interesting reading material on Spark and differences it introduces compared to Hadoop and Scalding.

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

karthik20522/SprayLearning

karthik20522/SprayLearning | Scalding, Functional Programming, and Data Science | Scoop.it
SprayLearning - Learning
Patrick Ting's insight:

Cool series of tutorials on Scala related technologies.

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Data Science Tools: Fast, easy to use, and scalable - Data

Data Science Tools: Fast, easy to use, and scalable - Data | Scalding, Functional Programming, and Data Science | Scoop.it
Here are a few observations based on conversations I had during the just concluded Strata Santa Clara conference. Spark is attracting attention I've written numerous times about components of...
Patrick Ting's insight:

Wohoo! Spark Streaming and everything awesome!

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Data Science & bitly: Hilary Mason - YouTube

[The Vermont Complex Systems Center at the University of Vermont presented this talk on April 13, 2012, as part of its Complex Systems Spire Speaker Series.]...
Patrick Ting's insight:

Interesting visualizations of bitly data.

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Scala: The Multi-Paradigm Language for the Multi-Core Age

Patrick Ting's insight:

Good comparison between Scala and Java.

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Everything I Ever Learned About JVM Performance Tuning @Twitter

Summarizes about a year worth of experiences and case studies in performance tuning the JVM for various services at Twitter.
more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

TSAR, a TimeSeries AggregatoR | Twitter Blogs

TSAR, a TimeSeries AggregatoR | Twitter Blogs | Scalding, Functional Programming, and Data Science | Scoop.it
Twitter is a global real-time communications platform that processes many billions of events every day. Aggregating these events in real time presents a massive challenge of scale. Classic time-ser...
more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

The Neophyte's Guide to Scala Part 12: Type classes - Daniel Westheide

The Neophyte's Guide to Scala Part 12: Type classes - Daniel Westheide | Scalding, Functional Programming, and Data Science | Scoop.it
After having discussed several functional programming techniques for keeping things DRY and flexible in the last two weeks, in particular function …
more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

SIP-21 - Spores - Scala Documentation

Patrick Ting's insight:

Interesting concept; I hope it makes it into the core of Scala!

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Abstractivate: Choosing an ExecutorService

Abstractivate: Choosing an ExecutorService | Scalding, Functional Programming, and Data Science | Scoop.it
Patrick Ting's insight:

Really informative read on Java ExecutorServices

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Design Patterns in Scala

Design Patterns in Scala | Scalding, Functional Programming, and Data Science | Scoop.it
[raw] hljs.initHighlightingOnLoad(); h3 a, h3 a:hover, h3 a:visited { color: #000000; } pre { width: 580px; margin-right: 0; } pre code { padding: 5px 10px; background: #F9F9F9; border: 1px solid #...
Patrick Ting's insight:

Cool list of design patterns written in Scala!

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Movie recommendations and more with Spark - Crouching Data, Hidden Markov

This post is inspired by Edwin Chen’s post on Scalding. I encourage you to first read that post! The Spark code is adapted from his Scalding code and …
Patrick Ting's insight:

Cool! Spark port of the original Scalding blog post

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Parquet Hadoop Summit 2013

Parquet presentation given to the Hadoop summit 2013
more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

How to acquire the "Essential Skill Set"?- the Self Starter way.

How to acquire the "Essential Skill Set"?- the Self Starter way. | Scalding, Functional Programming, and Data Science | Scoop.it
The Essential Skills set are the basic fundamental skills which forms the foundation of every data scientist. Since my focus is more on a self-learner perspective so here I list certain online open resources that can be employed to develop the 4 Essential Skills of a Data Scientist as a whole. Th...
more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Samza: Real-time Stream Processing at LinkedIn

Samza: Real-time Stream Processing at LinkedIn | Scalding, Functional Programming, and Data Science | Scoop.it
Chris Riccomini discusses: Samza's feature set, how Samza integrates with YARN and Kafka, how it's used at LinkedIn, and what's next on the roadmap.
more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Scalding at Etsy

Patrick Ting's insight:

Interesting and funny slides.

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

Akka-based Data Extraction System Design - Alexander Chepurnoy

Introduction If you have an experience in data extraction systems, you know how hard it could be to develop. You need to implement workers then …
Patrick Ting's insight:

Cool demonstration of the power of Akka!

more...
No comment yet.
Scooped by Patrick Ting
Scoop.it!

johnmyleswhite/SimpleAintEasy

johnmyleswhite/SimpleAintEasy | Scalding, Functional Programming, and Data Science | Scoop.it
SimpleAintEasy - A compendium of the pitfalls and problems that arise when using standard statistical methods
more...
No comment yet.