 Your new post is loading...
Quick reminder about five commonly held, yet inaccurate beliefs, related to Big Data. [Sponsored content] "It appears that Big Data is still a very big mystery to many, including some in the field. Here are the top five Big Data myths."
Good overview on how marketing departments can utilitize the deluge of data to their firm's advantage. Content is sponsored article by Adobe, on their excellent content marketing portal CMO.com "It’s no secret that brands are being deluged by what one expert refers to as “a tsunami of data.” But what to do with all of this newfound data and information presents many challenges and pitfalls." More: http://www.cmo.com/planning/how-rein-riches-big-data
This 3 minute video does a terrific job visualizing the impact big data is having on our online privacy. A must watch.
In the right hands and handled strategically, the massive amounts of information companies collect today can become a valuable new asset.
Microsoft has big plans for big data with Windows 8: In updating the road map for its Windows Embedded platform, Microsoft shows how it plans to tap into the Big Data opportunity created by a world of connected intelligent devices.
Johns Hopkins is taking a $1.2 million grant from the National Science Foundation to build a 100 gigabit per second network to shuttle data from the campus to other large computing centers at national labs and even at Google. The network will be capable of transferring an amount of data equivalent to 80 million file cabinets filled with text each day. More: http://gigaom.com/2011/11/08/for-science-big-data-is-the-microscope-of-the-21st-century/
Apart from industry hype it's easier to say what big data is not. To begin with it is not Hadoop (see preface to this series: What is Hadoop?). Nor is it simply having lots of data. And especially it is nothing to do with having lots of transactional data Let's think about data growth for a moment. The first thing to note is that petabyte scale storage issues are not new: CERN had a distributed Objectivity-based database holding a petabyte back in the 90s, long before the large Hadron collider was much more than a dream in the eye of most physicists. All that's happened is that the commercial world is catching up on the scientific community. And, of course, it's all relative: what's big to me may be chickenfeed to you. More: http://www.it-director.com/content.php?cid=13033
"First, the doubling time of medical knowledge is currently estimated to be less than ten years and continues to decrease. Second, the volume of medical literature that a busy doctor needs to read to stay current keeps increasing and all too frequently different research reports present apparently contradictory evidence – frequently, no single treatment approach is suitable for all cases. Finally, as in all other aspects of our life, the amount of useful interpretable health data being collected has started to grow at an explosive rate. When you consider these factors, combined with the pressure to maintain quality while seeing increasing volumes of patients, it’s easy to understand why many doctors today feel that there has to be a different and more satisfying way to deliver the high quality care that they believe is possible." More: http://blogs.sas.com/content/hls/2011/10/04/healthcare-has-a-problem-big-data-the-law-of-seven/
Kaggle is a platform where companies, researchers and governments can host competitions to help solve huge data-related problems. About 50 competitions have been run to date and members take just days to solve problems that have stumped scientists for years.
|
With the recent IPO of Splunk (currently valued at just over $3 Billion), a lot of attention has turned to Big Data. Problem is, it’s tough to keep track of all the companies involved in the space.
For those who can make sense of the explosion of data, there are job opportunities in fields as diverse as crime, retail and dating.
Bold prediction from Dan Vesset of IDC on the near and medium term future for big data.
By 2015, nearly 3 billion people will be online, pushing the data created and shared to nearly 8 zettabytes. Is your network ready for the deluge? Exploring this infographic is the first step toward building a tangible plan and it may be the difference between reacting and prospering in Big Data’s shadow.
Fascinating: Bitly's shortened links can forecast the next trend online—a partnership with Verisign is about to boost that predictive power.
E-commerce site Etsy has grown to 25 million unique visitors and 1.1 billion page views per month, and it's generating the data volumes to match. Using tools such as Hadoop and Splunk, Etsy is turning terabytes of data per day into a better product.
Accel Partners has set aside $100 million to invest in start-ups that are trying to harness the power of big data, the firm will announce at the Hadoop World conference in New York this morning, aiming to consolidate its position as one of the earliest investors in these companies as the amount of data generated by companies and government agencies continues to grow.
Hadoop benefits elusive for typical enterprises; Informatica hopes to change that: "Informatica takes an important baby step towards bringing data transformation inside Hadoop, a development that will lower the barriers for enterprises wishing to tap the power of MapReduce processing in working with unstructured or variably structured big data. Informatica may not be the first vendor to develop tooling for processing and it certainly won’t be the last, but the release of its new HParser tool is an important step in making Hadoop a more civilized platform for the enterprise." More: http://ovum.com/2011/11/03/informatica-makes-first-step-to-transforming-data-inside-hadoop/
We all know about Big Data as it refers to growing workloads of structured and unstructured data pouring into data centers. Well, networks handling Big Traffic are what brings all this data into servers, storage arrays and cloud services.
Sybase has announced the newest version of its analytics database, Sybase IQ. This latest release, 15.4, is aimed at helping clients handle the challenges of mining big data. The vendor has expanded support for in-database analytics and IQ now integrates with Apache Hadoop, an open-source framework used for data analysis. The tool also has predictive model markup language (PMML) support and an expanded library of statistical and data mining algorithms. Dan Lahl, senior direct of product marketing at Sybase, and Neil McGovern, senior director of marketing for the financial services industry at Sybase, spoke to BST about the trends in the big data space and what the industry can expect from future Sybase releases.
|