Entrepreneurship, Innovation
55
The 21st Century for Grade A Entrepreneurs
Follow
Scooped by Marylene Delbourg-Delphis onto Entrepreneurship, Innovation
Scoop.it!

Twitter is really smart to acquire Bluefin Labs - Going Social Now

Twitter is really smart to acquire Bluefin Labs - Going Social Now | Entrepreneurship, Innovation | Scoop.it
No comment yet.
Marylene Delbourg-Delphis is also curating
TalentCircles OperaMania Cloud Computing News
Discover Topics Marylene Delbourg-Delphis is following
Content Curation World Poezibao Digital Natives Journalism and Internet Intel Free Press Flipboard
and 5 others
Your new post is loading...
Rescooped by Marylene Delbourg-Delphis from Content Curation World
Scoop.it!

Scrape Structured Data Easily from Any Web Page with TheWebMiner

Robin Good's curator insight, December 27, 2012 2:12 PM



TheWebMiner is an experimental web-based scraping tool capable of rapidly capturing any amount of structured data from any web page (for example a Google search result page). Once captured the data can be saved in SQL, CSV, XML, JSON and Excel file formats.

TheWebMiner is very easy to use, as you only need to provide the reference URL page, and text samples of the structured data units that you would like to capture.


The service is free until January 5 2013.


Pricing thereafter: http://thewebminer.com/priceing


Read more about it: http://extract-web-data.com/thewebminer-a-cloud-scraping-tool/



Try it out now here: http://thewebminer.com/demo (watch the video first)


More info: http://thewebminer.com/









Therese Torris's curator insight, December 28, 2012 9:55 AM

Structured data scraping/crawling/extraction is a major step up from the simpler text and image extraction now familiar from generalist content curation tools like Scoop.it. It's about recognizing and extractic specific types of structured data like product names, product prices, taxes, brand names etc. 

 

Structured data extraction used to be the priviledge of a few high tech companies like Google.

 

It looks like it will soon be everybody's tool !