What are the steps to deploy a big data solution ?
-
- There are three steps to deploy a Big Data Solution
![Deploying Big Data solution](https://cdn.wikitechy.com/interview-questions/big-data/deploying-big-data-solution.png)
Data Ingestion
- The first step for deploying a big data solution is the data ingestion i.e. extraction of data from various sources.
- The data source may be a CRM like Salesforce, Enterprise Resource Planning System like SAP, RDBMS like MySQL or any other log files, documents, social media feeds etc.
- The data can be ingested either through batch jobs or real-time streaming. The extracted data is then stored in HDFS.
![Data Ingestion in Big Data](https://cdn.wikitechy.com/interview-questions/big-data/data-ingestion-in-big-data-solution.png)
Data Storage
![Data Storage](https://cdn.wikitechy.com/interview-questions/big-data/data-storage-in-big-data-solution.gif)
Data Processing
- The final step in deploying a big data solution is the data processing.
- The data is processed through one of the processing frameworks like Spark, MapReduce, Pig, etc.
![Data Processing in Big Data](https://cdn.wikitechy.com/interview-questions/big-data/data-processing-in-big-data.gif)