Derrick Harris Mar 30, 2011 (Oct 13, 2020)

Defining Hadoop: the Players, Technologies and Challenges of 2011

Summary
About Derrick Harris
Introduction – Apache Hadoop
1. What Hadoop Is (and Is Not)
The Hadoop Ecosystem
1. The Distributions
2. Other Hadoop-Based Products
3. ISVs Supporting Hadoop
4. Hadoop Use Cases
  1. Who’s Using Hadoop
  2. Specific Use Cases
  3. State of Deployment (Research or Production)
  4. Deployment Size
Challenges
Outlook
1. New Technologies
2. New Opportunities
Further Reading
About GigaOm
Copyright

1. Summary

The word “Hadoop” almost inevitably comes up during any discussion about big data or next-generation analytics strategies, but there still is a fair amount of confusion about what Hadoop actually is and for what types of workloads it might best be used. Most concerned about Hadoop at least know that it, and Google MapReduce, on which it was based, have been used at massive scale by large web companies for applications such as search engines. In reality, Hadoop is so much more. This report takes a closer look at that reality, examining what Hadoop is (and isn’t), who’s doing what to productize it and why we can expect to see the market pick up serious steam in 2011.

Hadoop can be used for a wide variety of data-processing workloads, some of which are broadly applicable across any industry where unstructured data volumes are proliferating. Some use cases are even pushing Hadoop beyond its natural batchprocessing sweet spot. Further, it is being used by a growing number of companies across the many industries — including those among the Fortune 100 — and there is a growing number of commercial software vendors selling products designed to make Hadoop easier to use for mainstream customers. Probably the most famous is Cloudera, which gives away its own enterprise-hardened distribution of the core Apache Hadoop project, but also provides support, services and advanced management tools.

For all the advancement, though, Hadoop still has a long way to go before it becomes as widespread as its hype suggests. One big problem is that, despite the proliferation of tools designed to simplify the process of using Hadoop, it still is not always easy for inexperienced developers to create Hadoop applications and workflows. The result is that, as more organizations try to hire personnel to start or to grow their Hadoop deployments, it becomes difficult to find qualified people.

Hadoop will continue its trek to mainstream adoption in 2011. Companies of all types and in all geographic regions likely will be advancing or beginning their big data efforts in the coming months and years, and many will benefit from some hand-holding and technological assistance into this brave new world.

Blog

Matt Jallo May 15, 2024

Save Money and Increase Performance on the Cloud

One of the most compelling aspects of cloud computing has always been the potential for cost savings and increased efficiency. Seen…

CxO Decision Brief CxO

Comissioned Research

Howard Holton May 10, 2024

CxO Decision Brief: Kubernetes Management for Platform Engineering Teams

This GigaOm CxO Decision Brief commissioned by Diamanti. In the rapidly evolving landscape of platform engineering, Kubernetes has emerged as a…

Radar Engineer

Premium

Matt Jallo May 6, 2024 (Apr 29, 2024)

GigaOm Radar for Cloud Resource Optimization

Cloud resources that are not optimized can prove costly. Cloud resource optimization solutions provide a holistic view of an organization’s public…

Radar Engineer

Premium

Dana Hernandez Apr 19, 2024 (Apr 12, 2024)

GigaOm Radar for Cloud FinOps

In modern IT environments, hybrid and multicloud infrastructures are now the norm, but runaway costs due to unmonitored growth and unanticipated…

Key Criteria VP/Architect

Premium

Matt Jallo Apr 16, 2024 (Apr 16, 2024)

GigaOm Key Criteria for Evaluating Cloud Resource Optimization Solutions

Cloud resource optimization solutions provide a holistic view across an organization’s public or private cloud infrastructure. They deliver and provision resource…

Key Criteria VP/Architect

Premium

Dana Hernandez Apr 5, 2024

GigaOm Key Criteria for Evaluating Cloud FinOps Solutions

The cloud has become the go-to hosting platform for enterprises, with multicloud and hybrid cloud infrastructures now the norm. However, managing…

Defining Hadoop: the Players, Technologies and Challenges of 2011

Table of Contents

1. Summary

Full content available to GigaOm Subscribers.

Table of Contents

1. Summary

Related Research

Full content available to GigaOm Subscribers.