Public Datasets -...
Follow
Find tag "dataviz"
8.0K views | +1 today
Public Datasets - Open Data -
Your new post is loading...
Your new post is loading...
Scooped by luiy
Scoop.it!

Project #BigData. Expanding on Project C to look at a different use case | #datascience #opendata

Project #BigData. Expanding on Project C to look at a different use case | #datascience #opendata | Public Datasets - Open Data - | Scoop.it
luiy's insight:

Project Big Data is an interactive tool which enables you to visualize and explore the funding patterns of over 600 companies in the Big Data ecosystem! It is based on the work I did for Project C (which you see and can read about here). The list of companies and their classification into categories is based on a dozen published sources and rough text analytics of the Crunchbase database. Crunchbase is a curated crowed sourced database of over 285k companies.

 

As for the data, there are 645 public & private companies in the data set. From Teradata and IBM to Actuate & Zoomdata. I began by harvesting data from Crunchbase by using their free API w/ Python. As of September, Crunchbase had 1250 funding events for 410 of the companies on my list. I've grouped these companies into 18 categories, allowing you to compare peers as well as trends across categories. Some of the categories are broken down further. For example, the tool allows you to differentiate between cloud-based and on premise solutions or SQL vs. NoSQL databases. I gathered additional data from a variety of sources. For example, LinkedIn was used to find the number of employees.

 

 

OPENACCESS Workbook: Project Big Data v1.0 

https://public.tableausoftware.com/download/workbooks/ProjectBigDatav1_0?format=html

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

All the Open #Datasets from New York City Visualized in a Single View | #opendata #dataviz

All the Open #Datasets from New York City Visualized in a Single View | #opendata #dataviz | Public Datasets - Open Data - | Scoop.it
luiy's insight:

"Visualizing NYC's Open Data" [chriswhong.com] by self-proclaimed urbanist, map maker and data junkie Chris Wong provides a single view of the more than 1,100 open datasets made available by New York City.

 

The visualization of the "dataset of datasets" consists of a force-directed graph, of which the nodes are colored according to whether the according dataset is a table, chart, map, a file or a user-created view (colored blue).

 

The graph acts as an alternative portal to explore the available data, while demonstrating its scale and diversity.

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

#OpenData Barometer Data | #opengov #dataviz

#OpenData Barometer Data | #opengov #dataviz | Public Datasets - Open Data - | Scoop.it
luiy's insight:

The Open Data Barometer takes a multidimensional look at the current adoption level of open data policy and practice around the world. Three main categories are considered as part of the barometer:

 

- Readiness - identifies how far a country has in place the political, social and economic foundations for realising the potential benefits of open data. The Barometer covers the readiness of government, entrepreneurs and business, and citizen and civil society.

 

- Implementation - identifies the extent to which government has published a range of key datasets to support innovation, accountability and more improved social policy. The barometer covers 14 datasets split across three clusters to capture datasets commonly used for: securing government accountability; improving social policy; and enabling innovation and economic activity.

 

- Emerging impacts - identifies the extent to which open data has been seen to lead to positive political, social and environment, and economic change. The Barometer looks for political impacts – including transparency & accountability, and improved government efficiency and effectiveness; economic impacts – through supporting start-up entrepreneurs and existing businesses; and social impacts – including environmental impacts, and contributing to greater inclusion for marginalised groups in society.

 

These factors have been combined onto a Radar chart, this represents the countries barometer.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Mapping Data: A guide for making #geodata visualizations | #ddj #methods #tools

Mapping Data: A guide for making #geodata visualizations | #ddj #methods #tools | Public Datasets - Open Data - | Scoop.it
luiy's insight:

As an add-on to our presentation we produced two more things, that some of you out there mind find helpful too:

 

- Mappable Toolset: The number of tools to process data, make maps, interactive visualizations etc. is continuously growing. While we love new tools, this leads to a situation that makes it quite hard to keep an overview of which tools are good for a certain tasks, where to find them and how much they cost. To keep track of the tools we've used so far and as a guide for others we thus collected our toolset. Have a look at it here:English version, German version.


-  Mappable Cheat-Sheet: Making maps and other visualizations with a geospatial component is certainly not a trivial tasks. There are many pitfalls, take alone spatial reference systems as an example, that might completely mess up your visualization if you don't handle them correctly. We thus created a checklist for making geodata visualizations in (data-driven) journalism. You can find it here: English version, German version.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Open #Development Toolkit | #opendata #tools #dataviz

Open #Development Toolkit | #opendata #tools #dataviz | Public Datasets - Open Data - | Scoop.it
Open Development Toolkit
luiy's insight:

I want to visualise aid data

I want to understand aid flows into my country / from my country / from a certain donor /globallyI want to find aid data

 

What is ‘open development’?


Open Development sits at the intersection of the ‘open’ movement, and international development. This could take the form of looking at how open data can affect decisions made within international development; open access to research materials; or opening up the ways we work, for example by being more inclusive, to name just a few examples. If you want to find out more about open development, join the Open Development mailing list, where a wide range of topics within Open Development are discussed. It’s open to everyone to join!

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

The GDELT Project: realtime network diagram and database of global human society for open research | #opendata

The GDELT Project: realtime network diagram and database of global human society for open research | #opendata | Public Datasets - Open Data - | Scoop.it
The GDELT Project
luiy's insight:

The GDELT Project is a realtime network diagram and database of global human society for open research Watching The Entire World

GDELT monitors the world's news media from nearly every corner of every country in print, broadcast, and web formats, in over 100 languages, every moment of every day.

 


Global Reach

 

GDELT monitors print, broadcast, and web news media in over 100 languages from across every country in the world to keep continually updated on breaking developments anywhere on the planet. Its historical archives stretch back to January 1, 1979 and update daily (soon to be every 15 minutes). Through its ability to leverage the world's collective news media, GDELT moves beyond the focus of the Western media towards a far more global perspective on what's happening and how the world is feeling about it.

 

 

Querying, Analyzing and Downloading

 

The entire GDELT database is 100% free and open and you can
download the raw datafiles, visualize it using the GDELT Analysis Service, or analyze it at limitless scale with Google BigQuery.

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

#Opendata Compass. What type the companies use which agencies data? | #GovLab

#Opendata Compass. What type the companies use which agencies data? | #GovLab | Public Datasets - Open Data - | Scoop.it
The Open Data 500 is the first comprehensive study of U.S. companies that use open government data to generate new business and develop new products and services.
luiy's insight:

Study Goals

Provide a basis for assessing the economic value of government open data Encourage the development of new open data companies
Foster a dialogue between government and business on how government data can be made more useful


The Govlab’s Approach

The Open Data 500 study is conducted by the GovLab at New York University with funding from the John L. and James S. Knight Foundation. The GovLab works to improve people’s lives by changing how we govern, using technology-enabled solutions and a collaborative, networked approach. As part of its mission, the GovLab studies how institutions can publish the data they collect as open data so that businesses, organizations, and citizens can analyze and use this information.


Next Steps

The GovLab is now planning to use this study’s findings to convene a series of roundtables between government agencies and businesses that use their data to help improve the processes and priorities for data release. The Department of Commerce has committed to participate in the first discussion; other federal Departments have expressed an intent to participate in future roundtables.

In addition to our work in the U.S., we are now in discussions with representatives of several national governments and international organizations about the potential to replicate the Open Data 500 study in other countries.

 

more...
Fàtima Galan's curator insight, May 26, 3:31 AM

"#OpenData is free, public data that can be used to launch commercial and nonprofit ventures, do research, make data-driven decisions, and solve complex problems."

Scooped by luiy
Scoop.it!

Serendipity is an faceted search engine based on #Semantic Web Technologies | #OpenCourse #dataviz

Serendipity is an faceted search engine based on #Semantic Web Technologies | #OpenCourse #dataviz | Public Datasets - Open Data - | Scoop.it
luiy's insight:

Serendipity is an faceted search engine based on Semantic Web Technologies. As an important feature of Serendipity, Serendipity POIs (Points of Interest), allows users visualize OCW Repositories from an dataset based on LInked Data technologies.


Serendipity is sponsored by the research group GICAC from the Universidad Politécnica de Madrid (GICAC-UPM) and the Universidad Técnica Particular de Loja (UTPL) in collaboration with the OCW Institutions. This project aims to improve the searchability and discoverability of open educational content, which will enhance the ability for learners and educators to find and use OCW courses.

more...
No comment yet.
Scooped by luiy
Scoop.it!

New version of #opensource visualization Head Start released | #OpenScience #dataviz

New version of #opensource visualization Head Start released | #OpenScience #dataviz | Public Datasets - Open Data - | Scoop.it
luiy's insight:

Head Start is a web application built with D3.js. The first version worked very well in terms of user interaction, but it was a nightmare to extend and maintain. Luckily, Philipp Weißensteiner, a student at Graz University of Technology became interested in the project. Philipp worked on the visualization as part of his bachelor’s thesis at the Know-Center. Not only did he modularize the source code, he also introduced Javascript Finite State Machine that lets you easily describe different states of the visualization. To setup a new instance of Head Start is now only a matter of a couple of lines. Philipp developed a cool proof of concept for his approach: a visualization that shows the evolution of a research field over time using small multiples. You can find his excellent bachelor’s thesis in the repository (German). - See more at: http://science.okfn.org/2014/02/24/new-version-of-open-source-visualization-head-start-released/#sthash.2cup1fOf.dpuf

more...
No comment yet.
Rescooped by luiy from Data is big
Scoop.it!

60+ R resources to improve your data skills | #datascience #dataviz

60+ R resources to improve your data skills | #datascience #dataviz | Public Datasets - Open Data - | Scoop.it
From books to videos to online tutorials -- most free! -- here are plenty of ideas to burnish your R knowledge.

Via ukituki
luiy's insight:

These websites, videos, blogs, social media / communities, software andbooks/ebooks can help you do more with R; the favorites are listed in bold.

 

 

more...
No comment yet.
Rescooped by luiy from Social Network Analysis #sna
Scoop.it!

Making sense of #BigData : mining Twitter names I #dataviz #dh

Making sense of #BigData : mining Twitter names I #dataviz #dh | Public Datasets - Open Data - | Scoop.it

Millions of geo tweets in various languages, discussing anything from 'hey, I'm here' to finance, geopolitics or marketing. How do you make sense of them? 


Via ukituki
luiy's insight:

Our name recognition software can predict, given a person name : its cultural and linguistic classification, country of origin, gender and spelling variants.

 

Our onomastics blog presents a few examples of data visualizations, prepared using NamSor™ Onomastics software (NomTri™).

To know more about what we do, visit our website at http://namsor.com/ or email us at contact@namsor.com

more...
ukituki's curator insight, December 21, 2013 8:27 AM

We’ve used name recognition (applied onomastics) to filter information and produce unique maps of the e-Diasporas. Where are the digitally connected Italian, Turkish and Russian today? They may be migrants, tourists, business travellers, student, visiting scientists…

Scooped by luiy
Scoop.it!

New MIT Media Lab Tool Lets Anyone Visualize Unwieldy Government Data I #opengob #opendata #dataviz

New MIT Media Lab Tool Lets Anyone Visualize Unwieldy Government Data I #opengob #opendata #dataviz | Public Datasets - Open Data - | Scoop.it
DataViva a project developed in part by Media Lab professor Csar Hidalgo aims to make a wide swath of government economic data usable with a series of...
luiy's insight:

In the four years since the U.S. government created data.gov, the first national repository for open data, more than 400,000 datasets have become available online from 175 agencies like the USDA, the Department of Energy, and the EPA. Governments all over the world have taken steps to make their data more transparent and available to the public. But in practice, much of that data--accessible as spreadsheets through sites like data.gov--is incomprehensible to the average person, who might not know how to wrangle huge data sets. Never-ending tables mean next to nothing to me, even if I know that they might be hiding some interesting relationship within their numbers, like how income stacks up with happiness.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Making data visualisations: a survival guide I #dataviz #dataJournalism

Data-Journalism Section or Website: Thomson Reuters for "Connected China" Reuters Connected Chaina uses not only a formidable amount of statistics but also s...
more...
Scooped by luiy
Scoop.it!

The First Interactive Network and Graph Data #Repository with Interactive Graph Analytics and Visualization | #opendata #SNA

The First Interactive Network and Graph Data #Repository with Interactive Graph Analytics and Visualization | #opendata #SNA | Public Datasets - Open Data - | Scoop.it
The First Interactive Network Data Repository with Real-time Interactive Visualization and Analytics
luiy's insight:

Network Data Repository. Exploratory Analysis & Visualization.

 

A network and graph data repository containing hundreds of real-world networks and benchmark datasets. This large comprehensive collection of network graph data is useful for making significant research findings as well as benchmark data sets for machine learning and network science. All data sets are easily downloaded into a standard consistent format. We also have built a multi-level interactive graph analytics engine that allows for visualizing the structure of networks as well as many global graph statistics and local node level properties. 

more...
No comment yet.
Rescooped by luiy from Geo-visualization
Scoop.it!

Visualizing Publicly Available US Government Data Online | #dataviz #opengov

Visualizing Publicly Available US Government Data Online | #dataviz #opengov | Public Datasets - Open Data - | Scoop.it

Via Nicholas Goubert
luiy's insight:

Brightpoint Consulting recently released a small collection of interactive visualizations based on open, publicly available data from the US government. Characterized by a rather organic graphic design style and color palette, each visualization makes a socially and politically relevant dataset easily accessible.

 

more...
Scooped by luiy
Scoop.it!

The #OpenData Research network | #opengov

The #OpenData Research network | #opengov | Public Datasets - Open Data - | Scoop.it
luiy's insight:

The Open Data Research network.

 

Governments, civil society organisations and companies across the world are actively engaging with open data: publishing and using datasets to promote innovation, development and democratic change.

 

The Open Data Research network has been established to connect researchers from across the world working to explore the implementation and impact of open data initiatives. It is a joint project of IDRC and the Web Foundation, and is seeking to develop wider partnerships over the coming year.

 

The network currently hosts the 'Exploring the Emerging Impacts of Open Data in Development Countries (ODDC)' programme.

more...
No comment yet.
Scooped by luiy
Scoop.it!

#OpenCorporates : How #complex are corporate structures? | #Opendata #dataviz

#OpenCorporates : How #complex are corporate structures? | #Opendata #dataviz | Public Datasets - Open Data - | Scoop.it
luiy's insight:

How complex are international corporate structures?

If you want to understand how complex multinational companies are, consider this:

 

In Hong Kong, there's a company called Goldman Sachs Structured Products (Asia) Limited. It's controlled by another company called Goldman Sachs (Asia) Finance, registered in Mauritius.

 

That's controlled by a company in Hong Kong, which is controlled by a company in New York, which is controlled by a company in Delaware, and that company is controlled by another company in Delaware called GS Holdings (Delaware) L.L.C. 

 

Which itself is a subsidiary of the only Goldman you're likely to have heard of, The Goldman Sachs Group in New York City.

That's only one of hundreds of such chains. All told, Goldman Sachs consists of more than 4000 separate corporate entities all over the world, some of which are around ten layers of control below the New York HQ.

 

Of those companies approximately a third are registered in nations that might be described as tax havens.Indeed, in the world of Goldman Sachs, the Cayman Islands are bigger than South America, and Mauritius is bigger than Africa.

These are maps of the top five banking companies in the US, and are based on publicy available data from the Federal Reserve. Read more about our data on the link at the top left.

more...
No comment yet.
Scooped by luiy
Scoop.it!

CitySDK: #toolkit for the development of digital services within cities | #opendata #smartcities

CitySDK: #toolkit for the development of digital services within cities | #opendata #smartcities | Public Datasets - Open Data - | Scoop.it
luiy's insight:

CitySDK is creating a toolkit for the development of digital services within cities. The toolkit comprises of open and interoperable digital service interfaces as well as processes, guidelines and usability standards. CitySDK enables a more efficient utilisation of the expertise and know-how of developer communities to be applied in city service development.

 

Apps and tools for CitySDK are developed in cooperation with the Code for Europe fellows (see www.codeforeurope.net)

more...
No comment yet.
Scooped by luiy
Scoop.it!

World Bank #Dataviz | #opendata #ddj

World Bank #Dataviz | #opendata #ddj | Public Datasets - Open Data - | Scoop.it
luiy's insight:

Visualizing:

 

- Where is the Extreme Poverty?

 

- Smoking Trends Around the Globe?

 

- What's the Story with Food Loss and Waste?

 

- Average House Size bi Country

 

- How easy is Business in Brazil

 

-  Have Industrialized Countries Reduced Their CO2 Emissions Since 1990? 

 

Submit your best development dataviz by clicking on the pencil icon on the right. If you have questions or comments about this Tumblr, please email dataviz@worldbank.org.

more...
No comment yet.
Rescooped by luiy from The urban.NET
Scoop.it!

City #Dashboard Amsterdam | #opengob #opendata #smartcities

City #Dashboard Amsterdam | #opengob #opendata #smartcities | Public Datasets - Open Data - | Scoop.it
luiy's insight:

Translating data into accessible information

 

Citizens and (government) agencies create and collect a lot of data, which they are now opening up for reuse more and more. This dashboard makes use of the latest open data from a wide range of municipal services. By using the Linked Data API from the CitySDK project , this City Dashboard becomes easily transferable to other cities using the same interface. The CitySDK Linked Data API also makes information searchable and available on demand, enabling developers to create applications such as this dashboard. 

more...
luiy's curator insight, May 24, 7:27 AM

The city in graphics

 

What is the average speed of traffic in the city? How many cultural events are going on? What are the levels of noise and what are people tweeting about? Answers to these and many other questions are featured and visualized in this dashboard of Amsterdam. Play and experiment with it, make different combinations and get to know the city real-time! 

 

 

 

How to use the City Dashboard

 

A city consists of many elements, shown on this dashboard as the following domains: transport, environment, statistics, economy, social, cultural & security. For each domain, the actual status is shown, based on blocks of 24 hours. The data is refreshed every 10 seconds. The information is captured in charts, graphs and on a map of the city. Larger dots and darker colors symbolize higher values and vice versa. On the map, you can choose which layers you want to see: from only one domain to interesting combinations and all of them together. Personalize what you are seeing and try to discover the city of Amsterdam on a whole new level! 

Mlik Sahib's curator insight, June 12, 12:53 AM
"General Information

 

Welcome to the City Dashboard! This platform shows you what is going on in Amsterdam at this moment.

The city in graphics

What is the average speed of traffic in the city? How many cultural events are going on? What are the levels of noise and what are people tweeting about? Answers to these and many other questions are featured and visualized in this dashboard of Amsterdam. Play and experiment with it, make different combinations and get to know the city real-time!

Translating data into accessible information

Citizens and (government) agencies create and collect a lot of data, which they are now opening up for reuse more and more. This dashboard makes use of the latest open data from a wide range of municipal services. By using the Linked Data API from the CitySDK project , this City Dashboard becomes easily transferable to other cities using the same interface. The CitySDK Linked Data API also makes information searchable and available on demand, enabling developers to create applications such as this dashboard.

How to use the City Dashboard

A city consists of many elements, shown on this dashboard as the following domains: transport, environment, statistics, economy, social, cultural & security. For each domain, the actual status is shown, based on blocks of 24 hours. The data is refreshed every 10 seconds. The information is captured in charts, graphs and on a map of the city. Larger dots and darker colors symbolize higher values and vice versa. On the map, you can choose which layers you want to see: from only one domain to interesting combinations and all of them together. Personalize what you are seeing and try to discover the city of Amsterdam on a whole new level!

No data available?

We aim to display all the numbers, graphics and other results in this City Dashboard as readable as possible. Since this platform collects data from different kind of sources that otherwise are stored in multiple databases, it may happen that particular data isn't available for a moment. In that case the data source is interrupted."

Scooped by luiy
Scoop.it!

Applications and datasets from the open government data | #OpenData #OpenGob #dataviz

Applications and datasets from the open government data | #OpenData #OpenGob #dataviz | Public Datasets - Open Data - | Scoop.it
luiy's insight:

As part of the “Open Government Data Switzerland” project, the Swiss Federal Archives and their project partners are operating a central pilot portal providing access to open data from the Swiss authorities (“open government data” or OGD). The pilot portal was launched on 16 September 2013 and is expected to remain online until the end of 2014. The authorities involved in the project are supplying some of their already accessible data for use in the pilot portal. These include a wide variety of data records, such as Swiss municipal boundaries, population statistics, up-to-date camera images of weather in Switzerland, historical documents and a directory of Swiss literature. 

more...
No comment yet.
Rescooped by luiy from Global Brain
Scoop.it!

The brain’s #visual data-compression #algorithm

The brain’s #visual data-compression #algorithm | Public Datasets - Open Data - | Scoop.it
Data compression in the brain: When the primary visual cortex processes sequences of complete images and images with missing elements — here vertical contours — it “subtracts” the images from each other (the brain computes the differences between...

Via Spaceweaver
luiy's insight:

Researchers have assumed that visual information in the brain was transmitted almost in its entirety from its entry point, the primary visual cortex (V1).

 

“We intuitively assume that our visual system generates a continuous stream of images, just like a video camera,” said Dr. Dirk Jancke from the Institute for Neural Computation at Ruhr University.

 

“However, we have now demonstrated that the visual cortex suppresses redundant information and saves energy by frequently forwarding image differences,” similar to methods used for video data compression in communication technology. The study was published inCerebral Cortex (open access).

 

Using recordings in cat visual cortex, Jancke and associates recorded the neurons’ responses to natural image sequences such as vegetation, landscapes, and buildings. They created two versions of the images: a complete one, and one in which they had systematically removed vertical or horizontal contours.

 

If these individual images were presented at 33Hz (30 milliseconds per image), the neurons represented complete image information. But at 10Hz (100 milliseconds), the neurons represented only those elements that were new or missing, that is, image differences.

 

To monitor the dynamics of neuronal activities in the brain in the millisecond range, the scientists used voltage-dependent dyes. Those substances fluoresce when neurons receive electrical impulses and become active, measured across a surface of several square millimeters. The result is a temporally and spatially precise record of transmission processes within the neuronal network

more...
No comment yet.
Rescooped by luiy from SNA - Social Network Analysis ... and more.
Scoop.it!

#DataScience Workflow: Overview and Challenges I #methods #research

#DataScience Workflow: Overview and Challenges I #methods #research | Public Datasets - Open Data - | Scoop.it
I provide an overview of the data science workflow and highlight some challenges that data scientists face in their work.

Via João Greno Brogueira
luiy's insight:

@luiy. Great article about #DataScience: the workflow design, methods and problematics. 

 

What do data scientists do at work, and what challenges do they face?

 

This post provides an overview of the modern data science workflow, adapted from Chapter 2 of my Ph.D. dissertation, Software Tools to Facilitate Research Programming.

The Data Science Workflow

The figure below shows the steps involved in a typical data science workflow.  There are four main phases, shown in the dotted-line boxes: preparation of the data, alternating between running the analysis andreflection to interpret the outputs, and finally dissemination of results in the form of written reports and/or executable code.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Media Tools – Google : Digital tools for #datajournalism I #opentools

Media Tools – Google : Digital tools for #datajournalism I #opentools | Public Datasets - Open Data - | Scoop.it
luiy's insight:

Welcome to Google Media Tools. Consider this your starting point to tap into Google’s suite of digital tools that can enhance newsgathering and exposure across television, radio, print and online.

 

Whether it’s refining your advanced search capabilities, improving audience engagement through Google+, or learning how to visualize data using Google Maps, this website is intended to guide you through all the resources Google offers to journalists.

more...
No comment yet.