Data Flow Languages & Apache Pig- BD analytics
Overview
General Data Model for Dataflow Languages
Data Flow Programming Paradigm
Pipe Diagrams
Apache Pig [60, 61, 62]
Data Model for Apache Pig [62]
2 Pig Latin
Scripting Language Pig Latin [62]
Relational Operators [62]
Relational Operators [62]
Non-relational Operators [62]
3 Accessing Data
Accessing and Manipulating Data with Pig
Debugging [62]
Pig Examples: Our Student/Lecture Example
Pig Examples: Our Student/Lecture Example
Preprocessor [67]
Embedding Pig into Python [62]
Writing UDFs in Python [62]
4 Architecture
File Formats
Execution of Pig Queries on MapReduce and TEZ
Performance Advises and Parallelism [62]
Optimization of Joins [62]
Summary