pig

Home » pig

Azure HDInsight

By | 2018-08-01T11:30:28+00:00 February 2nd, 2018|Azure HDInsight, Technologies|

Azure HDInsight is a fully managed, full-spectrum, open-source analytics service for enterprises. HDInsight is a cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. HDInsight also supports a [...]

Apache Tez

By | 2018-08-01T12:29:06+00:00 February 1st, 2018|Apache Tez, Technologies|

Introduction The Apache TEZ® project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data. It is currently built atop Apache Hadoop YARN. The 2 main design [...]

Apache Hive 

By | 2018-08-01T12:52:59+00:00 January 31st, 2018|Apache Hive, Technologies|

The Apache Hive™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage and queried using SQL syntax. Built on top of Apache Hadoop™, Hive provides the following features: Tools to enable easy [...]

Apache Pig

By | 2018-08-01T13:09:25+00:00 January 30th, 2018|Apache Pig, Technologies|

Apache Pig is a high-level language platform developed to execute queries on huge datasets that are stored in HDFS using Apache Hadoop. It is similar to SQL query language but applied on a larger [...]

Apache Hadoop

By | 2018-08-01T13:10:56+00:00 January 29th, 2018|Apache Hadoop, Technologies|

The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers [...]