Hadoop, Java Combo Top of Mind for Hiring Managers

By Darryl K. Taft  |  Posted 2013-05-30 Print this article Print

Big data has opened the door to new job opportunities. An online search for Hadoop jobs will bring up a string of positions available for data scientists, Hadoop engineers, data analysts, big data engineers, big data testers, big data architects, business intelligence engineers and more. Hadoop, which allows for the distributed processing of large data sets across clusters of computers using simple programming models, is red-hot right now and has become the de facto standard for storing, processing and analyzing hundreds of terabytes, and even petabytes of data. Hadoop can handle all types of data from disparate systems: structured, unstructured, log files, pictures, audio files, communications records and email, among other things, regardless of its native format. It has major cost advantages as it allows the use of commodity hardware. A look inside Dice's Open Web, which aggregates public social and professional networks from across the Web and puts them into a "super" profile for recruiters, shows the skills hiring managers search for in combination with Hadoop. In a report, Howard Lee, chief architect of Dice's Open Web, wrote, "The number one combination by a large margin—Hadoop and Java—[is] not too surprising, given it’s a Java-based framework." Here are the top 10 skills hiring managers seek in combination with Hadoop.

  • Hadoop, Java Combo Top of Mind for Hiring Managers

    by Darryl K. Taft
    1 - Hadoop, Java Combo Top of Mind for Hiring Managers
  • Java Takes the Lead

    Java is the leading skill hiring managers look for in combination with Hadoop, according to Dice's Open Web. Java remains immensely popular and is the second most popular language in the TIOBE Programming Community Index for May 2013. Hadoop and the Hadoop Distributed File System (HDFS) are written in Java.
    2 - Java Takes the Lead
  • Hadoop Developers Needed

    Dice’s Open Web identifies "developer" as the second most-sought-after skill in addition to Hadoop. Hadoop developer training also is in high demand. For instance, Cloudera offers a popular training course on Hadoop development.
    3 - Hadoop Developers Needed
  • NoSQL Plus Hadoop Equals Good Money

    Dice’s Open Web ranks NoSQL skills as the third most-sought-after skill set in addition to Hadoop. Professionals with Hadoop and NoSQL experience pulled in more than $100,000 a year on average.
    4 - NoSQL Plus Hadoop Equals Good Money
  • MapReduce Skills Make a Difference

    MapReduce skills are the fourth most-sought-after in addition to Hadoop by hiring managers, according to Dice. MapReduce, initially developed at Google, is a programming model for processing large data sets with a parallel, distributed algorithm on a cluster. Hadoop is a popular free implementation of the MapReduce model.
    5 - MapReduce Skills Make a Difference
  • Big Data Catches All

    The catchall term of big data skills came in fifth as the most popular among hiring managers in addition to Hadoop. Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. The challenges include capture, curation, storage, search, sharing, transfer, analysis and visualization.
    6 - Big Data Catches All
  • Pig Grabs Sixth Slot

    Demand for professionals experienced with the Pig platform ranked sixth among hiring managers looking for Hadoop experts. Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with the infrastructure for evaluating these programs. Pig's structure is amenable to substantial parallelization, which helps in handling very large data sets. Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs.
    7 - Pig Grabs Sixth Slot
  • Linux and Hadoop

    The seventh most-sought-after skill in combination with Hadoop was Linux. Linux is the foundation of many big data efforts, including those at SAP HANA, Hortonworks, WANDisco, Teradata and IBM’s Watson to name a few.
    8 - Linux and Hadoop
  • Python Slithers Into Eighth

    The combination of Python programming skills and Hadoop is the eighth most popular among hiring managers, according to the Dice report. Python is a programming language that lets IT professionals work more quickly and integrate systems more effectively. Python supports multiple programming paradigms, including object-oriented, imperative and functional programming styles.
    9 - Python Slithers Into Eighth
  • Hive Brings SQL Along

    Coming in ninth among hiring managers was the combination of Hadoop and Hive. Hive is a data warehouse system for Hadoop that facilitates easy data summarization, ad hoc queries and the analysis of large datasets stored in Hadoop-compatible file systems. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL. At the same time, this language also allows traditional map/reduce programmers to plug in their custom mappers and reducers when it is inconvenient or inefficient to express this logic in HiveQL.
    10 - Hive Brings SQL Along
  • Scala Makes Top 10

    The combination of Scala programming skills and Hadoop rounded out the list of the top 10 most-sought-after skill sets, according to Dice. Scala is a general-purpose programming language designed to express common programming patterns in a concise, elegant and type-safe way. It smoothly integrates features of object-oriented and functional languages, enabling Java and other programmers to be more productive.
    11 - Scala Makes Top 10

Submit a Comment

Loading Comments...
Manage your Newsletters: Login   Register My Newsletters

Thanks for your registration, follow us on our social networks to keep up-to-date
Rocket Fuel