Advantages of pig in Hadoop
-
- The development time is Decrease.
- It is a collection of Rich set of operators in order to perform operations such as join, filer, sort and many more.
- Pig Latin is similar to SQL, it becomes very easy for developers to write a Pig script.
- The task execution in Apache Pig it gets automatically optimized , hence the programmers need to focus on semantics language.
- Using the existing operators, users can easily develop their own functions to process,read, and write data.
- Pig provide the facility for creating User Defined Functions UDF’s, we can easily create a number of programming languages such as Java and invoke them in Pig Scripts.
- All types of data handling are having both structured & unstructured is provided by Apache Pig and the results are stored inside HDFS.
- You have UDFs which you want to parallellize and utilize for large amounts of data, then you are in luck. Use Pig as a base pipeline where it does the hard work and you just apply your UDF in the step that you want.