Text Analytics
Follow
8.5K views | +2 today
 
Scooped by Stuart Shulman
onto Text Analytics
Scoop.it!

NYT: Big Data Troves Stay Forbidden

Huge repositories of data collected by Internet companies are not accessible to scientists, leading some to complain that studies based on these data can't be peer-reviewed.
more...
No comment yet.
Text Analytics
Archive, Search, Filter, Cluster, Human Code & Machine Classify Text
Curated by Stuart Shulman
Your new post is loading...
Your new post is loading...
Scooped by Stuart Shulman
Scoop.it!

Helping Jakarta track flooding in real time to save more lives | Twitter Blogs

Helping Jakarta track flooding in real time to save more lives | Twitter Blogs | Text Analytics | Scoop.it
PetaJakarta.org uses Twitter data and community participation to increase public safety during floods.
Stuart Shulman's insight:

Great use of a Twitter #datagrant

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

DRAFT: Code of Ethics & Standards for Social Data

DRAFT: Code of Ethics & Standards for Social Data | Text Analytics | Scoop.it
[PDF version: FINAL DRAFT Code of Ethics for Social Data] PREAMBLE Social media offers an unprecedented set of opportunities and responsibilities for individuals and organizations. For individuals,...
Stuart Shulman's insight:

This is still a work in progress. The @bbi Board and members would like to hear your thoughts on v2. 

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Building a complete Tweet index | Twitter Blogs

Building a complete Tweet index   | Twitter Blogs | Text Analytics | Scoop.it
Today, we are pleased to announce that Twitter now indexes every public Tweet since 2006. Since that first simple Tweet over eight years ago, hundreds of billions of Tweets have captured everyday h...
Stuart Shulman's insight:

A massive achievement. Hats off to @TwitterEng for this amazing upgrade to the advanced Twitter search.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Big Boulder Initiative Workshop

Big Boulder Initiative Workshop | Text Analytics | Scoop.it

The Big Boulder Initiative is excited to host a half-day workshop to advance work on the draft Code of Ethics (http://bit.ly/1oZ7Isr), which was first presented at the recent Big Boulder conference. Please join us from 12-5 PM in Boston on October 14th for a thought-provoking and collaborative session that will produce a final published code of ethics we can all stand behind. 

Stuart Shulman's insight:

A great opportunity to shape a code of ethics for the social data industry.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

CONTAGION—From Justin Bieber to data scientists, how Twitter got hot in the academy

CONTAGION—From Justin Bieber to data scientists, how Twitter got hot in the academy | Text Analytics | Scoop.it
Studying viral trends on the social media platform has gone viral itself: the seventh in a Fortune series on how things spread.
Stuart Shulman's insight:

"A couple years ago, Sherry Emery, a health economist at the University of Illinois at Chicago, found herself reading tweets about “smoking hot girls.” Also about “smoking ribs,” “smoking weed,” and the “smoking chimney” of the papal conclave."

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Open Data on Net Neutrality: Help Crowd Source Analysis of Comments to the FCC

Open Data on Net Neutrality: Help Crowd Source Analysis of Comments to the FCC | Text Analytics | Scoop.it

Yesterday the FCC released the public comments on Net Neutrality. The FCC has asked the public to help make “visualizations” to help surface substantive comments and key themes. 

Stuart Shulman's insight:

We invite you to join our collaborative, web-based effort to find substantive comments and visualize what the public said about Net Neutrality. You can work directly with me and others to crowd source the review of the non-duplicate comments, or you can conduct your own parallel project with the same data.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Hashtag MRX Influencers

A response to the question posed on Twitter about who the influential #mrx Tweeters are.
Stuart Shulman's insight:

Just a few of the options when working with #Twitter data in DiscoverText.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Big Boulder 2014 Highlights

This is "Big Boulder 2014 Highlights" by Gnip on Vimeo, the home for high quality videos and the people who love them.
Stuart Shulman's insight:

Excellent conference and launch of @bbi, a trade association for the social data industry.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Social Data & Tools: Prizes for Academics

Social Data & Tools: Prizes for Academics | Text Analytics | Scoop.it

"We felt inspired by the recent #DataGrants experiment sponsored by Twitter that generated more than 1,300 proposals from 60 countries and resulted in six extremely interesting awards. One thing is clear: many more grants of social data licenses are needed to fuel academic research." 

Stuart Shulman's insight:

Twelve winners every month this summer will get tools and premium#socialdata for free. Twitter, Tumblr, WordPress & Disqus data prizes and three grand prizes of #free historical Twitter data.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

2014 Big Boulder Conference

2014 Big Boulder Conference | Text Analytics | Scoop.it
An historic event bringing together top publishers, industry leaders and consumers of social media data for two days of thought leadership to discuss trends, insights, and developments in the emerging social data economy.
Stuart Shulman's insight:

A new social data industry association will launch in 24 hours at #bigboulder. Sign up to get involved shaping the future landscape of social data marketplace.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Briefing for SurveyMonkey

7.5 minute briefing for SurveyMonkey personnel on some of the interesting use cases when you bring large response sets into DiscoverText.
Stuart Shulman's insight:

Of all the cool stuff covered here, I think the automated clustering has the greatest potential to transform the way researchers review large numbers of open-ended survey responses.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Leveraging the Search API

Leveraging the Search API | Text Analytics | Scoop.it
Brands these days are savvy about comprehensively tracking keywords, competitors, hashtags, and so on. But there will always be unanticipated events or news stories that pop up. The keywords associated with these events are rarely ever tracked in advance. So … Continue reading →
Stuart Shulman's insight:
A great feature in the Gnip toolkit.
more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

SD Times: The Big Boulder Initiative

This initiative seeks to get business intelligence and data-gathering companies to adopt standards and regulations
Stuart Shulman's insight:

Getting close to the launch of the Big Boulder Initiative.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Historical Twitter Prize Drawing

Historical Twitter Prize Drawing | Text Analytics | Scoop.it

We need beta testers to close out our first year of new product development. 

Stuart Shulman's insight:

We will award five equal prizes in December. The winners will be able to get new estimates for up to 3 Twitter days and no more than 200,000 tweets. Texifter will pay the license fees and the winners will have access to the data in gratis DiscoverText Enterprise accounts for 90 days.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Twitter Search Index Update

A 2-minute introduction to the new Twitter search that goes back to the dawn of Twitter time.
Stuart Shulman's insight:

My take on why this is great.

more...
No comment yet.
Rescooped by Stuart Shulman from Data Nerd's Corner
Scoop.it!

Times Data Scientist Highlights Importance of User Data | News | The Harvard Crimson

Times Data Scientist Highlights Importance of User Data | News | The Harvard Crimson | Text Analytics | Scoop.it
Wiggins said that a good data scientist needs interpersonal skills because the ability to communicate ideas is crucial to getting any job done. Moreover, a data scientist should strive to build models that are “both predictive and interpretable” and constantly engage in critical thinking, he said.

Audience members said they found the lecture informative.

“It was interesting to learn how a print media company that has turned to an experiment in digital media wants to learn more about its users and how they can better serve them,” said Justin Ellis, a staff member at the Nieman Journalism Lab.

Via Carla Gentry CSPO
Stuart Shulman's insight:

I have seen Mr. Higgins speak and he is a great example of the cross-disciplinary data scientist who performs significant acts of translation.

more...
Carla Gentry CSPO's curator insight, November 3, 9:24 AM

the role of a data scientist is especially important in an age in which the journalism industry is adjusting to fast-paced changes to its business model, driven in part by a decline in advertising revenue for print media.

Scooped by Stuart Shulman
Scoop.it!

How You Can Prepare for Twitter’s Potential Upcoming Changes

How You Can Prepare for Twitter’s Potential Upcoming Changes | Text Analytics | Scoop.it

Twitter strategy might be shifting and the implications for our personal and business experience could be profound.

Source: www.businessesgrow.com

Stuart Shulman's insight:

Algorithms everywhere shaping everything.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Texifter Social Data and Tools: July Prize Winners

As a part of getting new users to test our sifter beta, every month this summer we are awarding 12 #datagrants to academics. All you need to do to be included in the August drawing is submit a valid historical Twitter estimate request using sifter and then send us your CV. These prizes shave thousands of dollars of costs off of your research.

Stuart Shulman's insight:

Prizes are worth $9,000 - $16,000. One last drawing at the end of August. A five minute beta test could get you a million historical tweets and a year of free software.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Using Gnip PowerTrack Filters in Sifter

A 4-minute introduction to how Gnip PowerTrack filters work when generating free estimates from the complete history of Twitter.
Stuart Shulman's insight:

Better handling of historical Twitter rules.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Social Data & Tools June Prize Winners

Social Data & Tools June Prize Winners | Text Analytics | Scoop.it

As a part of getting new users to test our sifter beta, every month this summer we are awarding 12 #datagrants to academics. All you need to do to be included in the July drawing is submit a valid historical Twitter estimate request using sifter and then send us your CV. These prizes shave thousands of dollars of costs off of your research.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Data at Scale

Data at Scale | Text Analytics | Scoop.it
Dmitriy Ryaboy from Twitter discusses the challenges of scale. Twitter’s fail whale days may be a thing of the past but it’s still fun to reminisce. For Dmitriy Ryaboy, head of Twitter’s 40-person ...
Stuart Shulman's insight:

The inside scoop from Dmitriy @Twitter on how analytics has evolved through stages of rapid network and personnel growth.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

A Code of Ethics for Social Data: We Need Your Help!

A Code of Ethics for Social Data: We Need Your Help! | Text Analytics | Scoop.it

bbi One of the most important functions that the Big Boulder Initiative can provide is to help establish and clarify a code of ethics for the proper use of social data for industry, academia and other ...

Stuart Shulman's insight:

The first task for @bbi is to finalize a draft code of ethics for the fledgling social data industry. 

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

DiscoverText and SurveyMonkey Partnership | SurveyMonkey Blog

Now introducing our awesome partnership with the web-based collaborative text analysis platform, DiscoverText. Learn more!
Stuart Shulman's insight:

The technical integration and promotion of this partnership went off without a hitch. We are delighted to be the #bigdata solution suggested by the world's largest survey web site.

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

SurveyMonkey Integration with DiscoverText

Announcing a new strategic partnership and technical integration as part of the SurveyMonkey partner program. DiscoverText users can now import survey results directly from SurveyMonkey. This three-minute video shows step-by-step how to credential in to access your survey open ends as well as the structured metadata for advanced text analytics.
Stuart Shulman's insight:

We are very excited about the announcement tomorrow. The folks at SurveyMonkey have been great to work with. This is a big step for our little start-up. 

more...
No comment yet.
Scooped by Stuart Shulman
Scoop.it!

Historical Twitter Data

Historical Twitter Data | Text Analytics | Scoop.it

The application sifter provides search and retrieve access to every undeleted Tweet in the history of Twitter. Powered by Twitter-certified social data provider Gnip.com, users can submit "Gnip Historical PowerTrack" estimate requests using a variety of query rules. When the query is done, sifter generates an email estimating the approximate number of tweets responsive to the query and the cost to execute the retrieval. Once the Twitter data is licensed, we store the Twitter data in a free trial Enterprise DiscoverText account where you can perform advanced text analytics for 30 days - search, filter, cluster, code, and machine classify the data.

Stuart Shulman's insight:

We are completing work for a June launch of new filter tools that make it even easier to pull very precise samples from the complete (undeleted) history of Twitter. Bio, geo, language, influence, and more operators will be wired to drop down menus. 

more...
No comment yet.