Public Datasets -...
Follow
Find tag "tools"
7.9K views | +2 today
Public Datasets - Open Data -
Your new post is loading...
Your new post is loading...
Scooped by luiy
Scoop.it!

#DataMining: Practical Machine Learning #Tools and Techniques | #Weka #datascience #openaccess

#DataMining: Practical Machine Learning #Tools and Techniques | #Weka #datascience #openaccess | Public Datasets - Open Data - | Scoop.it
luiy's insight:

Teaching material

 

Slides for Chapters 1-5 of the 3rd edition can be found here.

Slides for Chapters 6-8 of the 3rd edition can be found here

 

These archives contain .pdf files as well as .odp files in Open Document Format that were generated using OpenOffice 2.0. Note that there are several free office programs now that can read .odp files. There is also a plug-in for Word made by Sun for reading this format. Corresponding information is on this Wikipedia page.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Seven Ways to Create a Storymap | #opendata #maps #ddj

Seven Ways to Create a Storymap | #opendata #maps #ddj | Public Datasets - Open Data - | Scoop.it
Evidence is Power
luiy's insight:

The above examples describe a wide range of geographical and geotemporal storytelling models, often based around quite simple data files containing information about individual events. Many of the tools make a strong use of image files as pat of the display. it may be interesting to complete a more detailed review that describes the exact data models used by each of the techniques, with a view to identifying a generic data model that could be used by each of the different models, or transformed into the distinct data representations supported by each of the separate tools.


- See more at: http://schoolofdata.org/2014/08/25/seven-ways-to-create-a-storymap/#sthash.tWi68hgm.dpuf

more...
No comment yet.
Scooped by luiy
Scoop.it!

DiRT Directory : digital research #tools | #openaccess #dh

DiRT Directory : digital research #tools | #openaccess #dh | Public Datasets - Open Data - | Scoop.it
luiy's insight:

The DiRT Directory is a registry of digital research tools for scholarly use. DiRT makes it easy for digital humanists and others conducting digital research to find and compare resources ranging from content management systems to music OCR, statistical analysis packages to mindmapping software.

more...
QLET's curator insight, August 21, 11:32 AM

Great index of digital research tools for the curious contemporary researcher.

Scooped by luiy
Scoop.it!

The Open Database Of The #Corporate World | #opendata #economy

The Open Database Of The #Corporate World | #opendata #economy | Public Datasets - Open Data - | Scoop.it
Free and Open Company Data on millions of companies and corporations in over 20 countries, including UK, Spain, US, ...
luiy's insight:

What is OpenCorporates?

 

OpenCorporates aims to do a straightforward (though big) thing: have a URL for every company in the world.

 

Is that all?

Well, no useful though that would be, we're also gradually importing government data relating to companies, and trying to match it to specific companies

 

Why do this?

Few parts of the corporate world are limited to a single country, and so the world needs a way of bringing the information together in a single place, and more than that, a place that's accessible to anyone, not just those who subscribe to proprietary datasets. See also the OpenCorporates Principles

 

There are quite a few countries you're missing

We've grown from 3 territories and a few million companies to over 75 jurisdictions and 55 million companies, and are working with the open data community to add more each week.

 

How can we get hold of the data?

We have a new API service, as well as our highly popular Google Refine reconciliation service (seedocumentation), and this allows access to the information as JSON or XML. If you need data in bulk, either for academic research work, for another cool open data project, or commercially, drop us an email atinfo@opencorporates.com.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Open #Development Toolkit | #opendata #tools #dataviz

Open #Development Toolkit | #opendata #tools #dataviz | Public Datasets - Open Data - | Scoop.it
Open Development Toolkit
luiy's insight:

I want to visualise aid data

I want to understand aid flows into my country / from my country / from a certain donor /globallyI want to find aid data

 

What is ‘open development’?


Open Development sits at the intersection of the ‘open’ movement, and international development. This could take the form of looking at how open data can affect decisions made within international development; open access to research materials; or opening up the ways we work, for example by being more inclusive, to name just a few examples. If you want to find out more about open development, join the Open Development mailing list, where a wide range of topics within Open Development are discussed. It’s open to everyone to join!

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

CitySDK: #toolkit for the development of digital services within cities | #opendata #smartcities

CitySDK: #toolkit for the development of digital services within cities | #opendata #smartcities | Public Datasets - Open Data - | Scoop.it
luiy's insight:

CitySDK is creating a toolkit for the development of digital services within cities. The toolkit comprises of open and interoperable digital service interfaces as well as processes, guidelines and usability standards. CitySDK enables a more efficient utilisation of the expertise and know-how of developer communities to be applied in city service development.

 

Apps and tools for CitySDK are developed in cooperation with the Code for Europe fellows (see www.codeforeurope.net)

more...
No comment yet.
Scooped by luiy
Scoop.it!

The Big Data #open source #tools | #bigdata

The Big Data #open source #tools | #bigdata | Public Datasets - Open Data - | Scoop.it
The Big Data open source tools landscape is growing rapidly. Check it out here.
luiy's insight:

There are already so many open source tools related to Big Data. Check out the below figure to find out about the most important open source tools for big data. In the near future we will describe each open source tool in more detail. At the moment you can click the logo’s to open the respective website

more...
No comment yet.
Scooped by luiy
Scoop.it!

#Privacy Tools: Opting Out from #DataBrokers | #datasets

#Privacy Tools: Opting Out from #DataBrokers | #datasets | Public Datasets - Open Data - | Scoop.it
Data brokers don't make it easy to see the data they hold about you. Here's what you can do to opt-out.
luiy's insight:

Data brokers have been around forever, selling mailing lists to companies that send junk mail. But in today’s data-saturated economy, data brokers know more information than ever about us, with sometimes disturbing results.


The first spreadsheet below is a list of data brokers who will give you copies of your data. (You can scroll around inside the box below, and you can also download your own copy of the spreadsheet, here.) The second is the list of data brokers from whom I sought to opt-out, with the ones that allowed opt-outs highlighted. (Download that onehere.)

more...
No comment yet.
Scooped by luiy
Scoop.it!

Immersion: a people-centric view of your email life | #SNA #tools #SNA_INDATCOM

Immersion: a people-centric view of your email life | #SNA #tools #SNA_INDATCOM | Public Datasets - Open Data - | Scoop.it
luiy's insight:
What is Immersion? It has been almost two decades since the beginning of the web. This means that the web is no longer just a technology of the present, but also, a record of our past. 

Email, one of the original forms of social media, is even older than the web and contains a detailed description of our personal and professional history. Immersion is an invitation to dive into the history of your email life in a platform that offers you the safety of knowing that you can always delete your data. Just like a cubist painting, Immersion presents users with a number of different perspectives of their email data. 
It provides a tool for self-reflection at a time where the zeitgeist is one of self-promotion. 
It provides an artistic representation that exists only in the presence of the visitor. 
It helps explore privacy by showing users data that they have already shared with others. 
Finally, it presents users wanting to be more strategic with their professional interactions, with a map to plan more effectively who they connect with. So Immersion is not about one thing. It’s about four. It’s about self-reflection, art, privacy and strategy. It’s about providing users with a number of different perspectives by leveraging on the fact that the web, and emails, are now an important part of our past.

 

 

 

Once you log in, Immersion will use only the From, To, Cc and Timestamp fields of the emails in the account you are signing in with. It will not access the subject or the body content of any of your emails.

 

Upon logging out of Immersion, you will be presented with a choice to save or delete your data, which contains your compressed email metadata and user profile. If you decide to save your email metadata with Immersion, that data will be stored in a secure system. You can always return to the site remotely and delete it at a later time, if you wish to do so.
If you take a snapshot of your Immersion network, the snapshot link will be accessible for 30 days, after which it will be deleted from our server.

more...
No comment yet.
Rescooped by luiy from Politique des algorithmes
Scoop.it!

Google has #open sourced a #tool for inferring cause from correlations | #algorithms #datascience

Google has #open sourced a #tool for inferring cause from correlations | #algorithms #datascience | Public Datasets - Open Data - | Scoop.it
Google open sourced a new package for the R statistical computing software that’s designed to help users infer whether a particular action really did cause subsequent activity. Google has been using the tool, called CausalImpact, to measure AdWords campaigns but it has broader appeal.

Via Dominique Cardon
luiy's insight:

Google announced on Tuesday a new open source tool that can help data analysts decide if changes to products or policies resulted in measurable change, or if the change would have happened anyway. The tool, called CausalImpact, is a package for the R statistical computing software, and Google details it in a blog post.

 

According to blog post author Kay H. Brodersen, Google uses the tool — created it, in fact — primarily for quantifying the effectiveness of AdWords campaigns. However, he noted, the same method could be used to gauge everything from whether adding a new feature caused an increase in app downloads to questions involving events in medical, social or political science.

 

http://google.github.io/CausalImpact/

 

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

import.io + Open Refine + Google Fusion Tables = Magic! | #SNA #ddj

import.io + Open Refine + Google Fusion Tables = Magic! | #SNA #ddj | Public Datasets - Open Data - | Scoop.it
luiy's insight:

The University of Ottawa Library holds an employee training week every year, giving colleagues the opportunity to share experiences, skills, and insights with one another. I jumped on this opportunity to showcase import.io as a means of creating datasets from website content. The tutorial I developed demonstrated how to create a dataset from the City of Ottawa’s open data catalogue. It’s a really simple example to get users familiar with the functionality of import.io, an easy way to scrape web content via a simple interface and without having to code. In this post I will also demo how to use Open Refine to clean the data captured by import.io and how to visualize it using Google Fusion Tables.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Mapping Data: A guide for making #geodata visualizations | #ddj #methods #tools

Mapping Data: A guide for making #geodata visualizations | #ddj #methods #tools | Public Datasets - Open Data - | Scoop.it
luiy's insight:

As an add-on to our presentation we produced two more things, that some of you out there mind find helpful too:

 

- Mappable Toolset: The number of tools to process data, make maps, interactive visualizations etc. is continuously growing. While we love new tools, this leads to a situation that makes it quite hard to keep an overview of which tools are good for a certain tasks, where to find them and how much they cost. To keep track of the tools we've used so far and as a guide for others we thus collected our toolset. Have a look at it here:English version, German version.


-  Mappable Cheat-Sheet: Making maps and other visualizations with a geospatial component is certainly not a trivial tasks. There are many pitfalls, take alone spatial reference systems as an example, that might completely mess up your visualization if you don't handle them correctly. We thus created a checklist for making geodata visualizations in (data-driven) journalism. You can find it here: English version, German version.

more...
No comment yet.
Scooped by luiy
Scoop.it!

ckan - The #OpenSource data #portal software | #opendata #Opengob

ckan - The #OpenSource data #portal software | #opendata #Opengob | Public Datasets - Open Data - | Scoop.it
The open source data portal software
luiy's insight:

CKAN is open source, free software. This means that you can use it without any license fees, but more importantly, when you choose CKAN for your catalog you are also ensuring that you retain all rights to the data and metadata you enter, giving you freedom to move it elsewhere or manipulate it with your own tools without restriction.


There are lots of different open source licenses (you can find them at http://opensource.org) – CKAN is licensed specifically under the terms of the Affero GNU GPL v3.0. One of the strengths of the open source model is in the communities that form around free software products. The CKAN community is no different, and is arguably one of the strongest open data communities in the world. Together, the CKAN community has a wealth of knowledge and expertise that other people using the CKAN software can draw on. The Open Knowledge Foundation draw on and contribute to this rich resource to help us drive CKAN product development. - See more at: http://ckan.org/developers/about-ckan/#sthash.nY6V2GU7.dpuf

 

- See more at: http://ckan.org/#sthash.LMq0zg2B.dpuf

more...
No comment yet.
Scooped by luiy
Scoop.it!

Top 5 #OpenSource project #management #tools in 2014

Top 5 #OpenSource project #management #tools in 2014 | Public Datasets - Open Data - | Scoop.it
A list of top 5 open source project management tools for 2014.
luiy's insight:

Opensource.com covered some popular open source project management tools (ProjectLibre, ]project-open[, and OpenProject.) We found these articles to be valuable to our readers, so here we take a look forward at what we think 2014 holds for these open source project management tools.

 

This is by no means an exhaustive list, but each tool listed here has been deliberately selected based on a rich feature set:

 

- ProjectLibre

- LibrePlan

- OpenProject

- ]project-open[

- Redmine 

- Agilefant

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

Case Studies on Collaboration, Crowdsourcing, Expert Networking and Local #Government Innovation | #opendata #tools

Case Studies on Collaboration, Crowdsourcing, Expert Networking and Local #Government Innovation | #opendata #tools | Public Datasets - Open Data - | Scoop.it
As part of an ongoing effort to build a knowledge base for the field of opening governance, the GovLab Wiki provides a collaborative repository of information and research at the nexus of technology, governance and citizenship. Every two weeks, The GovLab Blog will publish a snapshot of recent additions posted to the wiki. The following is a summary […]
luiy's insight:

Highlights

 

Our latest updates to the wiki focus on platforms for engaging the crowd, finding and engaging expertise and innovating at the local level – from improved hyperlocal city news to reducing strain on hospital emergency rooms.


• Tools for collaboration, like GitHub, have exploded in popularity over the course of the last four years – both in terms of active users and engagement on the platforms by those users.


• Crowdsourcing is increasingly being used a tool for addressing public problems and needs. SeeClickFix, for example, is helping local governments intelligently address problems affecting their citizens, and, perhaps surprisingly, these citizen-identified problems are being addressed in large numbers. Neighbor.ly, on the other hand, provides local governments with a platform for raising funding for public projects with demonstrated importance to citizens.


• Tools for identifying and engaging individuals with specific skills or interests – whether in terms of job experience, like LinkedIn, Futures.inc and oDesk; charitable interests, like Catchafire; or academic research focus, like ResearchGate – are becoming more prevalent, demonstrating the growing interest in such abilities across sectors.


• Tools for providing hyperlocal information, like EveryBlock, and personally relevant data, like Propellor Health can help improve citizens’ use of public services ranging from emergency rooms to community cultural centers.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Introducing Streamtools: A Graphical #Tool for Working with Streams of Data | #ddj #OpenNews

Introducing Streamtools: A Graphical #Tool for Working with Streams of Data | #ddj #OpenNews | Public Datasets - Open Data - | Scoop.it
Source - Journalism Code, Context & Community
luiy's insight:

INTRODUCING STREAMTOOLS: A GRAPHICAL TOOL FOR WORKING WITH STREAMS OF DATA

 

New and open source from the New York Times R&D Lab.

 

We see a moment coming when the collection of endless streams of data is commonplace. As this transition accelerates it is becoming increasingly apparent that our existing toolset for dealing with streams of data is lacking. Over the last 20 years we have invested heavily in tools that deal with tabulated data, from Excel, MySQL, and MATLAB to Hadoop, R, and Python+Numpy. These tools, when faced with a stream of never-ending data, fall short and diminish our creative potential.

 

In response to this shortfall we have created streamtools—a new, open source project by the New York Times R&D Lab which provides a general purpose, graphical tool for dealing with streams of data. It offers a vocabulary of operations that can be connected together to create live data processing systems without the need for programming or complicated infrastructure. These systems are assembled using a visual interface that affords both immediate understanding and live manipulation of the system.

  

more...
Mlik Sahib's curator insight, March 17, 11:09 PM

"In response to this shortfall we have created streamtools—a new, open source project by the New York Times R&D Lab which provides a general purpose, graphical tool for dealing with streams of data. It offers a vocabulary of operations that can be connected together to create live data processing systems without the need for programming or complicated infrastructure. These systems are assembled using a visual interface that affords both immediate understanding and live manipulation of the system."

 
Scooped by luiy
Scoop.it!

Media Tools – Google : Digital tools for #datajournalism I #opentools

Media Tools – Google : Digital tools for #datajournalism I #opentools | Public Datasets - Open Data - | Scoop.it
luiy's insight:

Welcome to Google Media Tools. Consider this your starting point to tap into Google’s suite of digital tools that can enhance newsgathering and exposure across television, radio, print and online.

 

Whether it’s refining your advanced search capabilities, improving audience engagement through Google+, or learning how to visualize data using Google Maps, this website is intended to guide you through all the resources Google offers to journalists.

more...
No comment yet.