All Forums Hadoop
ehbhaskar 2 posts Joined 06/15
10 Sep 2015
Splitsize for the Hive table in TDCH

I'm using TDCH to export a hive data into teradata table. For this I need to specify the number of mappers to my TDCH job. So, my question is "is this number of mappers option we give to TDCH job is just a hint to TDCH? or number of mappers created by TDCH will always be equal to number of mappers given in the option" (of the TDCH job)?
My assumption is that the number of mappers mainly depends on the split-size than on the given number of mappers (in the options of TDCH job) . Is my assumption correct for the TDCH jobs?.
Also, for the Hive table how is the split size defined? is that defined based on the number of rows? or it's just defined based on the size of the data (like 60MB or 120MB etc) similar to the cases like the "textfiles"?

You must sign in to leave a comment.