Apache Hive

Home » Apache Hive

Azure HDInsight

By | 2018-08-01T11:30:28+00:00 February 2nd, 2018|Azure HDInsight, Technologies|

Azure HDInsight is a fully managed, full-spectrum, open-source analytics service for enterprises. HDInsight is a cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. HDInsight also supports a [...]

Amazon Athena

By | 2019-01-31T10:32:25+00:00 February 1st, 2018|Amazon Athena, Technologies|

What is Amazon Athena? Amazon Athena is an interactive query service that makes it easy to analyze data directly in Amazon Simple Storage Service (Amazon S3) using standard SQL. With a few actions in [...]

Amazon EMR

By | 2019-06-19T11:32:56+00:00 February 1st, 2018|Technologies, Uncategorized|

What Is Amazon EMR? Amazon EMR is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, on AWS to process and analyze vast amounts of data. By using these [...]

Apache Tez

By | 2019-06-19T11:51:13+00:00 February 1st, 2018|Apache Tez, Technologies|

Introduction The Apache TEZ® project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data. It is currently built atop Apache Hadoop YARN. The 2 main design [...]

Apache Drill

By | 2018-08-09T06:21:32+00:00 February 1st, 2018|Apache Drill, Technologies|

Apache Drill: Drill is an Apache open-source SQL query engine for Big Data exploration. Apache Drill is designed from the ground up to support high-performance analysis on the semi-structured and rapidly evolving data coming from modern [...]

Spark SQL 

By | 2019-06-20T13:02:47+00:00 January 31st, 2018|SparkSQL, Technologies|

Apache Spark is a lightning-fast cluster computing framework designed for fast computation. It is of the most successful projects in the Apache Software Foundation. Spark SQL is a new module in Spark which integrates relational processing with [...]

Apache Hive 

By | 2018-08-01T12:52:59+00:00 January 31st, 2018|Apache Hive, Technologies|

The Apache Hive™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage and queried using SQL syntax. Built on top of Apache Hadoop™, Hive provides the following features: Tools to enable easy [...]

Cloudera Impala 

By | 2018-08-01T12:58:52+00:00 January 31st, 2018|Cloudera Impala, Technologies|

Cloudera Impala provides fast, interactive SQL queries directly on your Apache Hadoop data stored in HDFS or HBase. In addition to using the same unified storage platform, Impala also uses the same metadata, SQL [...]

Apache Spark 

By | 2018-08-01T13:02:14+00:00 January 30th, 2018|Apache Spark, Technologies|

Apache Spark is a fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk. Apache Spark has an advanced DAG [...]

Load More Posts
CONTACT US