Public Datasets -...
Follow
Find tag "clustering"
7.8K views | +3 today
Public Datasets - Open Data -
Your new post is loading...
Your new post is loading...
Scooped by luiy
Scoop.it!

7+ ways to plot dendrograms in R I #Clustering #DataScience

7+ ways to plot dendrograms in R I #Clustering #DataScience | Public Datasets - Open Data - | Scoop.it
Today we are going to talk about the wide spectrum of functions and methods that we can use to visualize dendrograms in R. You can check an extended version of this post with the complete reproduci...
luiy's insight:

A quick reminder: a dendrogram (from Greek dendron=tree, and gramma=drawing) is nothing more than a tree diagram that practitioners use to depict the arrangement of the clusters produced by hierarchical clustering.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Cleaning Data with OpenRefine I #datacleaning #clustering #openTools

Cleaning Data with OpenRefine I #datacleaning #clustering #openTools | Public Datasets - Open Data - | Scoop.it
luiy's insight:

Don’t take your data at face value. That is the key message of this tutorial which focuses on how scholars can diagnose and act upon the accuracy of data. In this lesson, you will learn the principles and practice of data cleaning, as well as how OpenRefine can be used to perform four essential tasks that will help you to clean your data:

 

Remove duplicate recordsSeparate multiple values contained in the same fieldAnalyse the distribution of values throughout a data setGroup together different representations of the same reality 

These steps are illustrated with the help of a series of exercises based on a collection of metadata from the Powerhouse museum, demonstrating how (semi-)automated methods can help you correct the errors in your data.

 
more...
No comment yet.