[Solved-2 Solutions] Generating an id/counter for foreach in pig latin ?
What is Foreach ?
- The FOREACH operator is used to generate specified data transformations based on the column data.
Syntax
- Here is the syntax of FOREACH operator
Problem:
- If you want some sort of unique identifier/line_number/counter to be generated/appended in foreach construct while iterates through the records. Is there a way to accomplish this without writing a UDF?
How to get that 'a_unique_id' implemented?
Solution 1 :
- If we care using pig 0.11 or later then the RANK is exactly what you are looking for.
Here is an example
Solution 2:
- There is no built-in UUID function in the main Pig distribution or piggybank. Unfortunately, we think your only option is going to be writing a UDF.