Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in Hadoop?

Start networking and exchanging professional insights

Next Question:
In list of invoice data - columns are date; customer, value, value paid; data is...
Posted by:
Bonnie Cheryl
20-January-2015

Question added by khageswar rao Battala , Software Engineer , Proven Technologies and Services Pvt Ltd
Date Posted: 2016/07/08

Upvote (1) Views (35) Followers (11) Answers (9) Report Question

Write an Answer
Register now or log in to answer.

9 Answers

by Amanul Islam Khan , Programmer Analyst , Cognizant Tech Solutions

Hash Partion is the default partioner in hadoop which is handled by Hadoop internally if no partioner has been defined.

Upvote (0)

Downvote Reply () Report

by ABHISHEK AMGOTH

Hash partitioning is Default partition done

Upvote (0)

Downvote Reply () Report

by Shamrooque R P , Apps Systems Engineer 6 , wells fargo

Hash partitioning which is the dafault partitioning in hadoop.

Upvote (0)

Downvote Reply () Report

by Ravindra Singh , Data Engineer , Confidential

Default Partitioner which buckets keys using a hash function is used.

Upvote (0)

Downvote Reply () Report

by Sneha Nair , Hadoop Developer , Techdata Solutions

Hashing would be used by default else write code to make it as key and set it in mapper class

Upvote (0)

Downvote Reply () Report

by Deleted user

By default Hash partioner is used for partioning the data

Upvote (0)

Downvote Reply () Report

by Reshmi KC , Assistant System Engineer , Tata Consultancy Services

add the partition column as the key for the mapper

Upvote (0)

Downvote Reply () Report

by Sabarinath Dhandapani

If there is no custom partitioner ,mapreduce by default uses hash algorithm (hash code for map key) and keys with same hashcode will send to same reducer.

Upvote (0)

Downvote Reply () Report

by Ahmed Gamil , Senior System Engineer , General Authority of Civil Aviation

Map output stored in memory then spilled to the disk when it reach to the buffer threshold

the spill files are merged into a single partitioned and sorted output file

the maximum number of streams to merge at once is 10

Upvote (0)

Downvote Reply () Report

More Questions Like This

How can you explain or respond to a customer that do not speak or understand English?
Top Answer: The most important is understanding, if the number of customers are more look for their dialect speaking ... See More

Answers (18)
How can we reduce customers' accounts?
Top Answer: What i can understand from your is reducing the Debtors Balance (if yes) good credit control policy and a ... See More

Answers (1)
Can you explain the importance of obtaining customer due diligence?
Top Answer: to comply with regulations and to make sure that the customers have a clean records and have not involved ... See More

Answers (2)
I am a sales and marketing,customer service representative looking for a job in dubai
Top Answer: Dear Rachel, To find job i ... See More

Answers (71)

Do you need help in adding the right keywords to your CV? Let our CV writing experts help you.

Get Help

Products By Bayt.com

Use Our Mobile App

Start networking and exchanging professional insights

Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in Hadoop?

Popular Searches

More Questions Like This