Top Hadoop Myths To Know

There’s a lot of confusion about Hadoop and where it fits into the overall Big Data landscape.

There’s a lot of confusion about Hadoop and where it fits into the overall Big Data landscape. Datamation takes a look at this open source software framework that stores and analyzes large data sets distributed across multiple servers–and dispels six big Hadoop myths.

Myth #1: Hadoop is a database

Hadoop is often talked about like it’s a database, but it isn’t. “There s nothing in the core Hadoop platform like a query or an index,” said Marshall Bockrath-Vandegrift, a software engineer with Damballa, a security company. Damballa uses Hadoop to analyze real-time security threats.

Myth #2: Hadoop Will Require Programmers

Depending on what you plan to do, this myth may come true. If you plan to build the next great Hadoop-based Big Data suite, you’ll need programmers who can write in Java and understand specialized MapReduce programming. However, if you’re content to build on the work of others, programming shouldn’t scare you off. Most data integration tools will have GUIs that abstract MapReduce programming complexity, and many come with pre-built templates.

Myth #3: Using Hadoop is Cheap

This is a common misconception associated with anything open source. Just because you’re able to reduce or eliminate the initial costs of purchasing software doesn’t mean you’ll necessarily save money. One of the problems with the cloud, for instance, is that it’s so easy to run a science project on Amazon that developers of all sorts throw projects up in AWS, forget about them, but keep paying for them.

Learn more in Six Hadoop Myths Exposed on Datamation.

This article was originally published on August 02, 2013

Webopedia Staff
Webopedia Staff
Since 1995, more than 100 tech experts and researchers have kept Webopedia’s definitions, articles, and study guides up to date. For more information on current editorial staff, please visit our About page.
Get the Free Newsletter
Subscribe to Daily Tech Insider for top news, trends & analysis
This email address is invalid.
Get the Free Newsletter
Subscribe to Daily Tech Insider for top news, trends & analysis
This email address is invalid.

Related Articles

Virtual Private Network (VPN)

A virtual private network (VPN) encrypts a device's Internet access through a secure server. It is most frequently used for remote employees accessing a...

Gantt Chart

A Gantt chart is a type of bar chart that illustrates a project schedule and shows the dependency between tasks and the current schedule...

Input Sanitization

Input sanitization is a cybersecurity measure of checking, cleaning, and filtering data inputs from users, APIs, and web services of any unwanted characters and...

IT Asset Management Software

IT asset management software (ITAM software) is an application for organizing, recording, and tracking all of an organization s hardware and software assets throughout...

Accenture

Accenture is a global professional services company that specializes in information technology (IT)...

Gartner

Gartner is a world-renowned information technology (IT) consultancy and advisory firm that conducts...

Pipedrive

Pipedrive is customer relationship management (CRM) software designed for sales. The software focuses...