Serde in hive
- Apache Hive uses Serde to read and write data from tables.
- A SerDe is a short name for a Serializer Deserializer.
- An important concept behind Hive is that it DOES NOT own the Hadoop File System (HDFS) format that data is stored in.
- Apache Hive SerDe.deserialize() used to select the data;when method is called and inserting the data SerDe.serialize() method is called.
- It uses the SerDe interface for IO.
- In SerDe interface handles both serialization and deserialization and also interpreting the results of serialization as individual fields for processing.
- It allows Hive to read data from a table, and write it back to HDFS in any format, user can write data formats.