pig tutorial - apache pig tutorial - Apache Pig - REPLACE() - pig latin - apache pig - pig hadoop
What is REPLACE()?
- REPLACE() function is used to replace all the characters in a given string with the new characters.
Syntax:
Given below is the syntax of the REPLACE() function. This function accepts three parameters, namely,
- string − The string that is to be replaced. If we want to replace the string within a relation, we have to pass the column name the string belongs to.
- regEXP − Here we have to pass the string/regular expression we want to replace.
- newChar − Here we have to pass the new value of the string.
Example:
Assume that there is a file named wikitechy_emp.txt in the HDFS directory /pig_data/as shown below. This file contains the employee details such as id, name, age, and city.
wikitechy_emp.txt
And, we have loaded this file into Pig with a relation named wikitechy_emp_data as shown below.
Following is an example of the REPLACE() function. In this example, we have replaced the name of the city Bhuwaneshwar with a shorter form Bhuw.
The above statement replaces the string 'Bhuwaneshwar' with 'Bhuw' in the column named city in the wikitechy_emp_data relation and returns the result. This result is stored in the relation named replace_data. Verify the content of the relation replace_data using the Dump operator as shown below.