huge-data
233 views | +0 today
Follow
huge-data
huge data and cloud computing
Curated by Ludaohong
Your new post is loading...
Your new post is loading...
Scooped by Ludaohong
Scoop.it!

中国IaaS产业2012年度点评 | 云计算与WEB开发–汉唐月博客

中国IaaS产业2012年度点评 | 云计算与WEB开发–汉唐月博客 | huge-data | Scoop.it
中国IaaS产业2012年度点评 - 分享到: 来源:汉唐月博客 作者 刘黎明 声明:本文拒绝SaaS博士之流转载和点评;欢迎任何非商业目的的转载和 [...]
more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

Salt Stack

Salt is an open source tool to manage your infrastructure. Easy enough to get running in minutes and fast enough to manage tens of thousands of servers (and still get a response back in seconds).

more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

百度开发者大会:推出开发者中心 发布云技术路线图

百度开发者大会:推出开发者中心 发布云技术路线图 | huge-data | Scoop.it
[CSDN.NET 付江/文]今天,由百度主办、CSDN等技术媒体协办,主题为应用万象 云创未来的 2012年百度开发者大会(专题) 在北京成功举办,包括百度公司创始人李彦宏在内的公司高管和众多业界技术精英纷纷与会。在本次大会上也宣布了包括推出 百度开发者中心 、云存储服务百度网盘、百度云OS技术架构、云技术路线图、整合旗下开放平台架构等一系列重要举措。...
more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

MapReduce 2.0 in Hadoop 0.23

MapReduce 2.0 in Hadoop 0.23 | huge-data | Scoop.it

In Building and Deploying MR2 we presented a brief introduction to MapReduce in Hadoop 0.23 and focused on the steps to set up a single-node cluster. This blog provides developers with architectural details of the new MapReduce design.

 

Apache Hadoop 0.23 has major improvements over previous releases. Here are a few highlights on the MapReduce front; note that there are also major HDFS improvements, which are out of scope of this post.

more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

NoSQL Data Modeling Techniques

NoSQL Data Modeling Techniques | huge-data | Scoop.it
NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency. This aspect of NoSQL is well-studied both in practice and theory because sp...
more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

HBase在淘宝的应用和优化小结 - NoSQLFan - 关注NoSQL相关技术、新闻

hbase是从hadoop中分离出来的apache顶级开源项目。由于它很好地用java实现了google的bigtable系统大部分特性,因此在数据量猛增的今天非常受到欢迎。对于淘宝而言,随着市场规模的扩大,产品与技术的发展,业务数据量越来越大,对海量数据的高效插入和读取变得越来越重要。由于淘宝拥有也许是国内最大的单一hadoop集群(云梯),因此对hadoop系列的产品有比较深入的了解,也就自然希望使用hbase来做这样一种海量数据读写服务。本篇文章将对淘宝最近一年来在online应用上使用和优化hbase的情况做一次小结。

more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

Less Than Dot - Blog - Monitoring and Logging as a Service - Reviews

Less Than Dot - Blog - Monitoring and Logging as a Service - Reviews | huge-data | Scoop.it
Ludaohong's insight:

In any case, let's see what we're going to be looking at:

Loggly - "It's fast, fun and easy to use"DataDog - "On a mission to bring sanity to IT Management"Splunk Storm - "Your data has the answers, we help you find them."Sumo Logic - "Make Your Applications Run Longer & Stronger"logentries - "We make your life easier"papertrail - "Get back to work."
more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

Precog:大数据分析即服务

Precog:大数据分析即服务 | huge-data | Scoop.it
近日,Precog宣布了他们的大数据仓储和分析服务,该服务负责处理数据的抓取、变换分析和可视化等过程,以及服务运行所基于的基础架构。不过,这一服务也通过RESTful API预留了各种开放的访问点,支持开发者和数据科学家控制整个过程。...
more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

How to crawl a quarter billion webpages in 40 hours | DDI

How to crawl a quarter billion webpages in 40 hours | DDI | huge-data | Scoop.it

More precisely, I crawled 250,113,669 pages for just under 580 dollars in 39 hours and 25 minutes, using 20 Amazon EC2 machine instances.

more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

Datomic: Distributed Database Designed to Enable Scalable...

Datomic: Distributed Database Designed to Enable Scalable... | huge-data | Scoop.it
I’m just starting to read about Datomic a distributed database designed to enable scalable, flexible and intelligent applications, running on next-generation cloud architectures. Skimming through the...
more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

Indexing Files via Solr and Java MapReduce

Several weeks ago, I set about to demonstrate the ease with which Solr and Map/Reduce can be integrated. I was unable to find a simple, yet comprehensive, primer on integrating the two technologies. So I set about to write one.


What follows is my bare-bones tutorial on getting Solr up and running to index each word of the complete works of Shakespeare. Note: Special thanks to Sematext for looking over the Solr bits and making sure they are sane. Check them out if you’re going to be doing a lot of work with Solr, ElasticSearch, or search in general and want to bring in the experts.

more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

走进Affinity:VMware开源数据库Affinity介绍 - NoSQLFan - 关注NoSQL相关技术、新闻

Affinity是VMware公司在今年2月28日发布的一种新型的开源数据库系统,其设计思想借鉴了关系型数据库、面向对象数据库、文档型数据库、RDF/XML数据库等多种数据库系统的优点,具有灵活多样、方便易用、接口丰富等优点。从今天起,我会陆续在博客上发表一些文章,分享关于Affinity数据库的一些知识,比如特性、查询以及使用方法等。今天的这篇文章,主要介绍Affinity数据库的团队、历史以及特性。

more...
No comment yet.
Scooped by Ludaohong
Scoop.it!

mrcc:基于MapReduce的分布式C语言编译器 - NoSQLFan - 关注NoSQL相关技术、新闻

mrcc:基于MapReduce的分布式C语言编译器 - NoSQLFan - 关注NoSQL相关技术、新闻 | huge-data | Scoop.it

这年头什么都讲分布式了,分布式存储,分布式计算。下面要介绍的是一个分布式C语言编译器:mrcc,它基于MapReduce 原理进行并行化编译。听起来实在太疯狂了,但是,它确实是这么干的。

more...
No comment yet.