e-Xploration
31.3K views | +2 today
Follow
e-Xploration
antropologiaNet, dataviz, collective intelligence, algorithms, social learning, social change, digital humanities
Curated by luiy
Your new post is loading...
Your new post is loading...
Scooped by luiy
Scoop.it!

The SHOGUN #MachineLearning #Toolbox | #datascience

The SHOGUN #MachineLearning #Toolbox | #datascience | e-Xploration | Scoop.it

The Shogun Machine learning toolbox provides a wide range of unified and efficientMachine Learning (ML) methods. The toolbox seamlessly allows to easily combine multiple data representations, algorithm classes, and general purpose tools. This enables both rapid prototyping of data pipelines and extensibility in terms of new algorithms. We combine modern software architecture in C++ with both efficient low-level computing backends and cutting edge algorithm implementations to solve large-scale Machine Learning problems (yet) on single machines.

 

One of Shogun's most exciting features is that you can use the toolbox through aunified interface from C++, Python, Octave, R, Java, Lua, C#, etc. This not just means that we are independent of trends in computing languages, but it also lets you use Shogun as a vehicle to expose your algorithm to multiple communities. We use SWIGto enable bidirectional communication between C++ and target languages. Shogun runs under Linux/Unix, MacOS, Windows.

more...
No comment yet.
Scooped by luiy
Scoop.it!

@WeAreTheDead : When #Twitter meets #Python! | #datascience

@WeAreTheDead : When #Twitter meets #Python! | #datascience | e-Xploration | Scoop.it

Reporters love Twitter and geeks love coding. Today, I’m merging the best of both worlds! On the menu: Python scripts to use Twitter to its full potential!

luiy's insight:

When my friend @TerraCiolfe showed me @WeAreTheDeads project, I said to myself that I really need to learn how to control Twitter through Python. @WeAreTheDeads is a Twitter account publishing the name of a fallen soldiers at the 11th minute of each hour.

 

Of course, nobody is working behind the screen. A program chooses the soldier in a database and publishes his name, hour after hour. With 119,000 names to publish, the script will run until 2023, according to the author of this great idea, the reporter @GlenMcGregor from the Ottawa Citizen.

 

With a little bit of research (my sources are at the end of the article), I learnt how to work with Twitter from a Python script. Actually, we can do way more than automatically publish tweets! It’s also possible to extract a lot of data about users and their tweets. For example, you can research specific tweets in a specific location. I created a nice animated map at the end. You’ll see!

more...
No comment yet.
Scooped by luiy
Scoop.it!

#Mining the Social Web, 2nd-Edition | #datascience #SNA #tools

#Mining the Social Web, 2nd-Edition | #datascience #SNA #tools | e-Xploration | Scoop.it
Mining-the-Social-Web-2nd-Edition - The official online compendium for Mining the Social Web, 2nd Edition (O'Reilly, 2013)
luiy's insight:

Chapter 0 - Preface

 

Chapter 1 - Mining Twitter: Exploring Trending Topics, Discovering What People Are Talking About, and More

 

Chapter 2 - Mining Facebook: Analyzing Fan Pages, Examining Friendships, and More

 

Chapter 3 - Mining LinkedIn: Faceting Job Titles, Clustering Colleagues, and More

 

Chapter 4 - Mining Google+: Computing Document Similarity, Extracting Collocations, and More

 

Chapter 5 - Mining Web Pages: Using Natural Language Processing to Understand Human Language, Summarize Blog Posts and More

 

Chapter 6 - Mining Mailboxes: Analyzing Who's Talking To Whom About What, How Often, and More

 

Chapter 7 - Mining GitHub: Inspecting Software Collaboration Habits, Building Interest Graphs, and More

 

Chapter 8 - Mining the Semantically Marked-Up Web: Extracting Microformats, Inferencing Over RDF, and More

 

Chapter 9 - Twitter Cookbook

 

Appendix A - Virtual Machine Experience

Appendix B - OAuth Primer

Appendix C - Python & IPython Notebook Tips

more...
No comment yet.
Scooped by luiy
Scoop.it!

Les #robots sur le Web social: état des lieux et #prospectives / via @LesDiplomates | #socialbots

Les #robots sur le Web social: état des lieux et #prospectives / via @LesDiplomates | #socialbots | e-Xploration | Scoop.it
Des robots avancés ont désormais infiltré les réseaux sociaux et peuvent interagir et tromper les êtres humains. Quelle impact sur les métiers du numérique?
luiy's insight:

De fait, les "bots" ne sont plus un épiphénomène, mais participent pleinement au fonctionnement d’Internet. Mais au-delà des classiquescrawlers bots, l’apparition de programmes automatiques plus ou moins raffinés sur le Web social pose indubitablement des questions d’ordre éthique, juridique et, surtout, stratégique.

 

 

Etat des lieux des robots à l’heure actuelle

 

A l’origine, les robots étaient des programmes informatiques censés effectuer des tâches répétitives, simples et automatisées, à un degré de fréquence plus ou moins élevé, avec le minimum d’implication.

 

Mais ces programmes ont gagné en raffinement à mesure qu’ils s’attaquaient aux réseaux sociaux. Désormais, des "socialbots" avancés ont infiltré Twitter et d’autres réseaux sociaux et sont en mesure de tromper les êtres humains. Si votre première pensée consiste à croire qu’un robot est facilement repérable et qu’il n’est pas très sophistiqué, vous êtes dans l’erreur. Un groupe de chercheurs brésiliens a récemment démontré que non seulement les robots étaient en mesure de pénétrer et stimuler des communautés, mais ils pouvaient également altérer leurs opinions et devenir des influenceurs.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Chorus Project : #Twitter #analytics tool suite | #bigdata

Chorus Project : #Twitter #analytics tool suite | #bigdata | e-Xploration | Scoop.it
Twitter data retrieval and visual analytics. Designed for social research. GUI based for easy access and fast productivity.
luiy's insight:

The Chorus package currently comprises of two distinct programs:

Tweetcatcher

Firstly, we have Chorus-TCD (TweetCatcher Desktop). Tweetcatcher allows users to sift Twitter for relevant data in two distinct ways: either by topical keywords appearing in Twitter conversation widely (i.e. semantically-driven data) or by identifying a network of Twitter users and following their daily ‘Twitter lives’ (i.e. user-driven data).

Tweetvis

Secondly, we have Chorus-TV (TweetVis), which is a visual analytic suite for facilitating both quantitative and qualitative approaches to social media data in social science. Visual analytics (VA) is an interdisciplinary computing methodology combining methods from data mining, information visualization, human-computer interaction and cognitive psychology. The VA approach is highly relevant to the aims of Chorus, enabling exploratory analysis of social media data in an intuitive and user-friendly fashion. Two main views are available within Chorus-TV. The Timeline Explorer (below) provides users an opportunity to analyse Twitter data across time and visualize the unfolding Twitter conversation according to various metrics (including tweet frequency, sentiment, semantic novelty and homogeneity, collocated words, and so on).

more...
No comment yet.
Scooped by luiy
Scoop.it!

La "boîte à outils" du cartographe de l’information et des réseaux | #SNA #gephi #tools

La "boîte à outils" du cartographe de l’information et des réseaux | #SNA #gephi #tools | e-Xploration | Scoop.it
luiy's insight:

La "boîte à outils" du cartographe de l’information et des réseaux s’est sérieusement étoffée depuis quelques mois. De quoi équiper un peu plus encore une activité qui connaît quelques succès aujourd’hui, et dont on commence à comprendre le rôle essentiel pour les organisations et les territoires (en rappelant, comme à chaque fois, que le travail du cartographe d’informations commence là où finissent les données et finit là où commence l’interprétation des phénomènes). La nouveauté, cette fois-ci, est qu’il s’agit de deux "plateformes" en ligne et non plus seulement d’un plug-in ou d’une application isolée. Et, dans les deux cas, elles viennent enrichir les contextes d’utilisation de GEPHI (pour la 5e année en 2013 au Google Summer of Code, le fameux Gsoc). La preuve, si besoin était, que Gephi n’est pas une "application" mais un écosystème d’innovation permanente constituée d’une multitude d’acteurs.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Chart.js | #HTML5 #Charts for your website | #dataviz #tools

Chart.js | #HTML5 #Charts for your website | #dataviz #tools | e-Xploration | Scoop.it
Open source HTML5 charts using the canvas tag. Chart.js is an easy way to include animated graphs on your website.
luiy's insight:
Easy, object oriented client side graphs for designers and developers
more...
No comment yet.
Scooped by luiy
Scoop.it!

Lincoln #Logarithms: Finding Meaning in Sermons | #dataviz #sna #DH

Lincoln #Logarithms: Finding Meaning in Sermons | #dataviz #sna #DH | e-Xploration | Scoop.it
luiy's insight:

The content and the tools


We explored the power and possibility of four digital tools—MALLET, Voyant, Paper Machines, andViewshare.  MALLET, Paper Machines, and Voyant all examine text.  They show how words are arranged in texts, their frequency, and their proximity. Voyant and Paper Machines also allow users to make visualizations of word patterns. Viewshare allows users to create timelines, maps, and charts of bodies of material. In this project, we wanted to experiment with understanding what these tools, which are in part created to reveal, could and could not show us in a small, but rich corpus.  What we have produced is an exploration of the possibilities and the constraints of these tools as applied to this collection.

more...
No comment yet.
Rescooped by luiy from Journalisme web et innovations
Scoop.it!

La Boîte à outils du journaliste web | #DDJ #tools #journalism

La Boîte à outils du journaliste web | #DDJ #tools #journalism | e-Xploration | Scoop.it
Voici une liste d'outils indispensables et gratuits pour tous les journalistes qui travaillent sur le web. Réseaux sociaux, édition multimédia en ligne, manipulation de code HTML... Le post sur le blog MediaType: http://nicolasbecquet.posterous.com/la-boite-a-outil-du-journaliste-web

Via Mathieu Gentile
more...
Mathieu Gentile's curator insight, May 31, 2014 4:42 AM

Pas si simple d'être journaliste sur le web, il faut être à l'écoute de ce qui se dit et s'écrit en permanence. Nicolas Becquet livre une panoplie d'outils tels Docteur Tweety ou Klynt....

Scooped by luiy
Scoop.it!

8 Web #Tools for Finding and Creating Data Visualizations | #dataviz #cartography

8 Web #Tools for Finding and Creating Data Visualizations | #dataviz #cartography | e-Xploration | Scoop.it
more...
No comment yet.
Scooped by luiy
Scoop.it!

An Interactive Introduction to Network Analysis and Representation | #SNA #tools

An Interactive Introduction to Network Analysis and Representation | #SNA #tools | e-Xploration | Scoop.it
luiy's insight:

This interactive application is designed to provide an overview of various network analysis principles used for analysis and representation. It also provides a few examples of untraditional networks used in digital humanities scholarship. Finally, along with the various methods described interactively here are links to related scholarship.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Datavisualization Selected #Tools | #dataviz #SNA

Datavisualization Selected #Tools | #dataviz #SNA | e-Xploration | Scoop.it
Datavisualization.ch Selected Tools is a collection of tools that we, the people behind Datavisualization.ch, work with on a daily basis and recommend warmly.
more...
No comment yet.
Rescooped by luiy from Homo Agilis (Collective Intelligence, Agility and Sustainability : The Future is already here)
Scoop.it!

Harnessing #CollectiveIntelligence: Wiki and Social Network From End-User Perspective #tools

Harnessing #CollectiveIntelligence: Wiki and Social Network From End-User Perspective #tools | e-Xploration | Scoop.it
In the social web in which ¿people socialize or interact with each other throughout the World Wide Web, social interactions lead to the creation of explicit and meaningfully rich knowledge representations¿. Emergence of social web shed light on the

Via Claude Emond
luiy's insight:

In the social web in which “people socialize or interact with each other throughout the World Wide Web, social interactions lead to the creation of explicit and meaningfully rich knowledge representations”. Emergence of social web shed light on the concept of collective intelligence (CI). Web 2.0 technologies as key part of social semantic web, play an important role to harness the CI. Web 2.0 technologies aredivided into the end-user and technical perspectives. In this paper CI and Web 2.0 is assessed with more details and througha theoretical framework regarding the end-user  perspective. From all various kinds of Web 2.0 technologies Wikis and Social networks are chosen due to their huge contribution to CI. This paper focuses on end-user perspective of Wiki and Social network; categorizes the end-user perspective of these two technologies into 4 core aspects; and on the basis of  findings from a web-based questionnaire, tests the relationship between each component of these 4 aspects and the CI

more...
Claude Emond's curator insight, February 2, 2014 9:49 PM

You can download this article right from the link (might require or not registering with FB or whatever) :)

Scooped by luiy
Scoop.it!

Abridged List of #MachineLearning Topics. #Resources #tools #datascience

Abridged List of #MachineLearning Topics. #Resources #tools #datascience | e-Xploration | Scoop.it
luiy's insight:

- Deep learning is a set of algorithms in machine learning that attempt to model high-level abstractions in data by using model architectures composed of multiple non-linear transformations.

 

- Online machine learning is a model of induction that learns one instance at a time thus reducing the amount of memory required.

 

- Natural Language Toolkit (NLTK) - a leading tool for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning.

 

-Computer Vision. OpenCV – popular computer vision library designed to by computational efficiency with a strong focus on real-time applications.

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

Overview of #Python Visualization Tools | #dataviz #datascience

Overview of #Python Visualization Tools | #dataviz #datascience | e-Xploration | Scoop.it
Overview of common python visualization tools
luiy's insight:

Introduction

 

In the python world, there are multiple options for visualizing your data. Because of this variety, it can be really challenging to figure out which one to use when. This article contains a sample of some of the more popular ones and illustrates how to use them to create a simple bar chart. I will create examples of plotting data with:

 

- Pandas

- Seaborn

- ggplot

- Bokeh

- pygal

- Plotly

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

Creating a #heatmap with Photoshop (for #NodeXL) | #SNA #gephi

Creating a #heatmap with Photoshop (for #NodeXL) | #SNA #gephi | e-Xploration | Scoop.it
luiy's insight:

The method presented here shows how to manually create a heatmap of any given NodeXL network with Photoshop. Since this is not an automated process, you can actually create a heatmap out of everything (if you have enough time and creativity). For example an “infrared-image” of a human person:

more...
No comment yet.
Scooped by luiy
Scoop.it!

Annotation Studio: suite of #collaborative web-based annotation #tools | #DH

Annotation Studio: suite of #collaborative web-based annotation #tools | #DH | e-Xploration | Scoop.it
luiy's insight:

Annotation Studio in the Digital Humanities

 

The most significant difference between Annotation Studio and other digital annotation projects is its emphasis on student-centered design and pedagogy. Most other annotation tools assume user familiarity with TEI, and a well-developed understanding of the relationships between literary sources, manuscripts, editions, and adaptations. Annotation Studio makes sophisticated yet easy-to-use commenting tools immediately accessible to students with no prior experience with close textual analysis or TEI.

 

However, while we believe Annotation Studio provides many unique affordances, we also see it as part of a larger conversation concerning annotation in the digital humanities. Accordingly, we have listed what we think are some of the most exciting projects occupying the annotation space, which bear both similarities and differences to the aims and formal qualities of our tool.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Morph : Get structured #data out of the web | #crawlers #datascience

luiy's insight:

Morph A Heroku for Scrapers

 

Get structured data out of the web

 

- All code and collaboration through GitHub

- Write your scrapers in Ruby, Python, PHP or Perl

- Simple API to grab dataSchedule scrapers or run manually

- Process isolation via Docker

- Trivial to move scraper code and data from ScraperWiki Classic

more...
No comment yet.
Scooped by luiy
Scoop.it!

Researchers Find and Decode the #SpyTools Governments Use to Hijack Phones | #privacy #surveillance

Researchers Find and Decode the #SpyTools Governments Use to Hijack Phones | #privacy #surveillance | e-Xploration | Scoop.it

Newly uncovered components of a digital surveillance tool used by more than 60 governments worldwide provide a rare glimpse at the extensive ways law enforcement and intelligence agencies use the tool to surreptitiously record and steal data from mobile phones.

luiy's insight:

The modules, made by the Italian company Hacking Team, were uncovered by researchers working independently of each other at Kaspersky Lab in Russia and the Citizen Lab at the University of Toronto’s Munk School of Global Affairs in Canada, who say the findings provide great insight into the trade craft behind Hacking Team’s tools.

 

The new components target Android, iOS, Windows Mobile, and BlackBerry users and are part of Hacking Team’s larger suite of tools used for targeting desktop computers and laptops. But the iOS and Android modules provide cops and spooks with a robust menu of features to give them complete dominion over targeted phones.

 

They allow, for example, for covert collection of emails, text messages, call history and address books, and they can be used to log keystrokes and obtain search history data. They can take screenshots, record audio from the phones to monitor calls or ambient conversations, hijack the phone’s camera to snap pictures or piggyback on the phone’s GPS system to monitor the user’s location. The Android version can also enable the phone’s Wi-Fi function to siphon data from the phone wirelessly instead of using the cell network to transmit it. The latter would incur data charges and raise the phone owner’s suspicion.

 

“Secretly activating the microphone and taking regular camera shots provides constant surveillance of the target—which is much more powerful than traditional cloak and dagger operations,” notes Kaspersky researcher Sergey Golovanov in a blog post about the findings.

more...
No comment yet.
Scooped by luiy
Scoop.it!

Visualizing Categorical Data as Flows with Alluvial Diagrams | #dataviz #methods #tools

Visualizing Categorical Data as Flows with Alluvial Diagrams | #dataviz #methods #tools | e-Xploration | Scoop.it
luiy's insight:

Alluvial diagrams are a type of flow diagram that  have traditionally been used to visually show changes in network structures over time. Density Design has included Alluvial Diagrams in their RAW online visualization tool and explored its use to show “relations between dimensions of categorical data.”

 

RAW is such a wonderfully easy tool to use that I wanted to explore the Alluvial diagram functionality with a few different data sets to see how the visualizations would come out.

 

more...
No comment yet.
Scooped by luiy
Scoop.it!

Klynt is an editing & publishing application dedicated to interactive #storytellers | #tools #ddj

Klynt is an editing & publishing application dedicated to interactive #storytellers | #tools #ddj | e-Xploration | Scoop.it
luiy's insight:

Klynt is an editing & publishing application dedicated to interactive storytellers. It was designed originally for Honkytonk Films in-house productions to create an affordable and easy-to-use solution to explore new narrative formats on the Internet.

 

Based on years of experience working with authors and interface designers on some of today’s most acclaimed interactive documentaries, the latest release of Klynt is now available for content producers active in the field of journalism, photography or documentary filmmaking.

 

Its growing user community is now composed of professionals working in NGOs (Greenpeace, WWF, Enfants du Mékong, UNICEF, The Red Cross), established media institutions (TV5 Monde, AFP (France), RTS (Switzerland), La Repubblica (Italy), RTBF (Belgium), Des Spiegel (Germany), Radio-Canada…) and students enrolled in 100+ prestigious journalism & visual communication training programs.

more...
No comment yet.
Scooped by luiy
Scoop.it!

The Miso Project. #OpenSource toolkit for #dataviz | #ddj #d3

The Miso Project. #OpenSource toolkit for #dataviz | #ddj #d3 | e-Xploration | Scoop.it
The Miso Project
luiy's insight:

Miso is an open source toolkit designed to expedite the creation of high-quality interactive storytelling and data visualisation content.

 

Miso consists of Dataset, a JavaScript client-side data management and transformation library, Storyboard, a state and flow-control management library & d3.chart, a framework for creating reusable charts with d3.js.

 

Miso is in active development, and will have components released as they are completed. Follow along: @TheMisoProject & Github

more...
No comment yet.
Rescooped by luiy from Big Data Analysis in the Clouds
Scoop.it!

Intro materials: network analysis software (UCINET, NodeXL, Gephi, Statnet, ERGM, RSiena) | #datascience #SNA

Intro materials: network analysis software (UCINET, NodeXL, Gephi, Statnet, ERGM, RSiena) | #datascience #SNA | e-Xploration | Scoop.it
Introductory materials, handouts and R scripts for network analysis and visualization.

 

In the fall of 2012, I got to design & lead the weekly labs for a network seminar at USC. I also worked on the methods portion of the syllabus for the class. COMM 645: Communication Networks is a PhD-level course taught by Peter Monge. The labs cover a range of network tools – from the classic UCINETprogram through NodeXL and Gephi, to R introduction, Statnet, exponential random graph and actor-based modeling. Since the handouts & script examples may be useful for people outside the course, I’m sharing them here.


Via Pierre Levy
more...
No comment yet.
Rescooped by luiy from intelligence collective
Scoop.it!

6th International #Conference on Computational #CollectiveIntelligence Technologies and Applications - ICCCI 2014

6th International #Conference on Computational #CollectiveIntelligence Technologies and Applications - ICCCI 2014 | e-Xploration | Scoop.it

Via Pierre Levy, thierrydenys
luiy's insight:

Important Dates

 

- Workshop / Special Session Proposals: Feb 20, 2014

 

- Notification of Workshop / Special Session acceptance: February 28, 2014

 

- Submission of papers (all): March 25, 2014

 

- Notification of acceptance: May 25, 2014

 

- Final papers to be received: June 10, 2014

 

- Conference: September 24-26, 2014

more...
No comment yet.
Rescooped by luiy from Humanities and their Algorithmic Revolution
Scoop.it!

Digital research #tools | #dh

Digital research #tools | #dh | e-Xploration | Scoop.it

Via Pierre Levy
luiy's insight:

This version of DiRT has been superseded by Bamboo DiRT, developed by Quinn Dombrowski and Project Bamboo.  Bamboo DiRT makes several improvements over the old DiRT and is much more current.  No new information will be added here as of 1/9/2012, but this wiki is still available for historical purposes. For more information, please see this message. 


Mew LINK:http://dirt.projectbamboo.org/

more...
Pierre Levy's curator insight, February 11, 2014 7:39 PM

Extensive trove of resources for digital humanities

sandra alvaro's curator insight, February 12, 2014 5:32 AM

wiki with resources for digital humanities projects