Advertisement

Apache Sqoop: Import using SPLIT BY and BOUNDARY QUERY

Apache Sqoop: Import using SPLIT BY and BOUNDARY QUERY --split-by : It is used to specify the column of the table used to generate splits for imports.
--boundary-query : By default sqoop will use query select min(), max() from to find out boundaries for creating splits. In some cases this query is not the most optimal so you can specify any arbitrary query returning two numeric columns using --boundary-query argument.

SQOOP,IMPORT,SPLITBY,BOUNDARY,QUERY,HDFS,HADOOP,SQOOP IMPORT,

Post a Comment

0 Comments