pig tutorial - apache pig tutorial - Apache Pig TOKENIZE() Function - pig latin - apache pig - pig hadoop

The TOKENIZE() function used in Apache Pig is used to split a string in a single tuple and returns a bag which contains the output of the split operation.
The TOKENIZE() function is used to break an input string into tokens separated by a regular expression pattern.
The TOKENIZE() function is when the Token elements are placed under the element
The TOKENIZE() function will returns one token element, which contains the input string.
The TOKENIZE() function has each substring value which is found between the separator matches is placed inside elements with the name token and the namespace mhub

Syntax

grunt> TOKENIZE(expression [, 'field_delimiter']) 

Example

wikitechy_student_details.txt

111,Suresh Reddy,21,Hyderabad
112,Arvin Battacharya,22,Kolkata 
113,Ramesh Khanna,22,Delhi 
114,Preethi Agarwal,21,Pune 
115,Sruthi Mohanthy,23,Bhuwaneshwar 
116,Vanitha Mishra,23 ,Chennai 
117,Kamala Nayak,24,trivendram 
118,Bhargavi Nambiayar,24,Chennai

We have loaded the file into Pig with the relation name wikitechy_student_details which is given below:

grunt> wikitechy_student_details = LOAD 'hdfs://localhost:9000/pig_data/wikitechy_student_details.txt' USING PigStorage(',')
as (id:int, name:chararray, age:int,  city:chararray);

Tokenizing a String

We can use the TOKENIZE() function to split into a string.

grunt> student_name_tokenize = foreach wikitechy_student_details Generate TOKENIZE(name);

Verification

grunt> Dump student_name_tokenize;

Output

({(Suresh),(Reddy)})
({(Arvin),(Battacharya)})
({(Ramesh),(Khanna)})
({(Preethi),(Agarwal)})
({(Sruthi),(Mohanthy)})
({(Vanitha),(Mishra)})
({(Kamala),(Nayak)})
({(Bhargavi),(Nambiayar)})

Related Searches to Apache Pig TOKENIZE() Function

apache pig tokenize function pig tokenize delimiter flatten in pig strsplit pig example pig split string example textloader in pig word count in pig pig strsplit flatten pig string length pig tokenizer pig substring pig replace pig tokenize example pig tutorial apache pig tutorial hadoop pig tutorial pig latin tutorial learn pig pig hadoop pig tutorial point learn pig latin pig big data pig latin hadoop apache pig pig latin pig commands pig hive pig interview questions hadoop pig hive pig script how to learn pig latin pig and hive pig language pig tutorial pdf apache pig tutorial pdf hadoop pig examples pig store pig programming apache pig download pig data pig script example pig group pig storage pig in latin pig order what is apache pig how to read pig latin pig flatten pigstorage flatten in pig pig latin examples pig mapreduce apache pig commands pig commands pdf pig examples pig load pig code guide pig pig jobs store command in pig tutorial peppa pig peppa pig tutorial simple pig how to write in pig latin datapig pig latin program uses of pig

pig tutorial - apache pig tutorial - Apache Pig TOKENIZE() Function - pig latin - apache pig - pig hadoop

What is TOKENIZE() function in Apache Pig ?

Syntax

Example

Tokenizing a String

Verification

Output

Related Searches to Apache Pig TOKENIZE() Function

Wikitechy

Workshop

Join our Community

Other Languages

pig tutorial - apache pig tutorial - Apache Pig TOKENIZE() Function - pig latin - apache pig - pig hadoop

What is TOKENIZE() function in Apache Pig ?

Syntax

Example

Tokenizing a String

Verification

Output

Related Searches to Apache Pig TOKENIZE() Function

Summer Offline Internship

Summer Online Internship

Internship in Chennai

Programming / Technology Internship in Chennai

Wikitechy

Workshop

Join our Community

Other Languages