sqoop - Sqoop Vs Apache Pig - apache sqoop - sqoop tutorial - sqoop hadoop
What is Apache Pig?
- Apache Pig is a high-level platform for creating programs that runs on Apache Hadoop and the language which is used for this platform is called Pig Latin.
Learn sqoop - sqoop tutorial - Sqoop Vs Apache Pig - sqoop examples - sqoop programs
Difference between Sqoop and Apache Pig:
Sqoop | Apache Pig |
---|---|
Sqoop is a tool that is extensively used to transfer large amounts of data from Hadoop to the relational database servers |
Apache Pig is a platform for managing large sets of data which consists of high-level programming to analyze the data. Pig also consists of the infrastructure to evaluate the programs. |
The common large objects in Sqoop are Blog and Clob | The scalar data types in pig are int, float, double, long, chararray, and bytearray. |
Sqoop allows to Export and Import the data from the data table based on the where clause |
The complex data types in Pig are map, tuple, and bag. |
Append: Append data to an existing dataset in HDFS. –append | Pig Latin:Simple programming: it is easy to code, execute and manage |
Example:[city’#’bang’,’pin’#560001] | Example: qoopeval –connect jdbc:mysql://db.test.com/corp –query “SELECT * FROM intellipaat_emp LIMIT 20” |