Apache Hive

Home » Apache Hive

Azure HDInsight

By | 2018-08-01T11:30:28+00:00 February 2nd, 2018|Azure HDInsight, Technologies|

Azure HDInsight is a fully managed, full-spectrum, open-source analytics service for enterprises. HDInsight is a cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. HDInsight also supports a [...]

Amazon EMR

By | 2018-08-01T12:07:12+00:00 February 1st, 2018|Technologies, Uncategorized|

What Is Amazon EMR? Amazon EMR is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, on AWS to process and analyze vast amounts of data. By using these [...]

Apache Tez

By | 2018-08-01T12:29:06+00:00 February 1st, 2018|Apache Tez, Technologies|

Introduction The Apache TEZ® project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data. It is currently built atop Apache Hadoop YARN. The 2 main design [...]

Apache Drill

By | 2018-08-09T06:21:32+00:00 February 1st, 2018|Apache Drill, Technologies|

Apache Drill: Drill is an Apache open-source SQL query engine for Big Data exploration. Apache Drill is designed from the ground up to support high-performance analysis on the semi-structured and rapidly evolving data coming from modern [...]

Spark SQL 

By | 2018-08-01T12:51:19+00:00 January 31st, 2018|SparkSQL, Technologies|

Apache Spark is a lightning-fast cluster computing framework designed for fast computation. It is of the most successful projects in the Apache Software Foundation. Spark SQL is a new module in Spark which integrates relational processing with [...]

Apache Hive 

By | 2018-08-01T12:52:59+00:00 January 31st, 2018|Apache Hive, Technologies|

The Apache Hive™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage and queried using SQL syntax. Built on top of Apache Hadoop™, Hive provides the following features: Tools to enable easy [...]

Apache Spark 

By | 2018-08-01T13:02:14+00:00 January 30th, 2018|Apache Spark, Technologies|

Apache Spark is a fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk. Apache Spark has an advanced DAG [...]

Load More Posts