[Solved-1 Solution] File formats that can be read using Pig
What is File format ?
- A file format is the layout of a file in terms of how the data within the file is organized. A program that uses the data in a file must be able to recognize and possibly access data within the file.
Problem:
- What kind of file formats can be read using Pig? How to store them in different formats?
Solution 1:
- There are a few built-in loading and storing methods, but they are limited:
- BinStorage - "binary" storage
- PigStorage - loads and stores data that is delimited by something (such as tab or comma)
- TextLoader - loads data line by line (i.e., delimited by the newline character)
- piggybank is a library of community contributed user-defined functions and it has a number of loading and storing methods, which includes an XML loader, but not a XML storer.