- Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. It is part of the Apache project sponsored by the Apache Software Foundation.
Hadoop was originally conceived on the basis of Google's MapReduce, in which an application is broken down into numerous small parts. Any of these parts (also called
fragments or blocks) can be run on any node in the cluster. Hadoop makes it possible to run applications on systems with thousands of nodes involving thousands of terabytes. A distributed file system (DFS) facilitates rapid data transfer rates among nodes and allows the system to continue operating uninterrupted in case of a node failure. The risk of catastrophic system failure is low, even if a significant number of nodes become inoperative.
The Hadoop framework is used by major players including Google, Yahoo and IBM, largely for applications involving search engines and advertising. The preferred operating systems are Windows and Linux but Hadoop can also work with BSD and OS X. Hadoop was originally the name of a stuffed toy elephant belonging to a child of the framework's creator, Doug Cutting.
Learn More About IT:
Visit the Hadoop Core Web site.
The Apache Software Foundation answers frequently asked questions about Hadoop.
Tim O'Reilly discusses Yahoo's support of Hadoop.
 |
Learn more about Cloud computing services |
| Cloud Computing e-book: This e-book examines the case for moving to cloud computing as well as issues involved with using the cloud as an application development platform, test bed and storage solution. |
| Amazon Web Services product directory: Discover the products, from Amazon and third-party companies such as CA, IBM and Oracle, available in the cloud to Amazon Web Services customers. |
| Cloud computing programming API tutorial: In this tutorial, we will dive into the more interesting sites with resources and tips on development. Find out about the trends for APIs in emerging cloud computing architectures. |
| Protocols for cloud services - Part 2: This article takes a look at JavaSpace and Hadoop, two different technologies that distribute computing jobs to cloud computing resources, manage it, and get results back. |
| Getting started with Amazon EC2: With all the hype surrounding Amazon's EC2 and Web Services, it's hard to find independent advice on how to start out. Here is a primer on stepping into the cloud. |
| LAST UPDATED: |
15 May 2009
|
 |
Do you have something to add to this definition? Let us know.
Send your comments to techterms@whatis.com
|


');
// -->


 |
 |
|  |
RELATED GLOSSARY TERMS
| Terms from Whatis.com − the technology online dictionary |
 |
Amazon Elastic Compute Cloud
(SearchCloudComputing.com)
The Amazon Elastic Compute Cloud (Amazon EC2) is a Web-based service that allows business subscribers to run application programs in the Amazon.com...
|
 |
Blue Cloud
(SearchCloudComputing.com)
Blue Cloud is an approach to shared infrastructure developed by IBM. The goal of IBM's Blue Cloud is to provide services that automate fluctuating...
|
|

|