Spark SQL 

By |2022-07-04T14:47:26+00:00January 31st, 2018|Informative, SparkSQL, Technologies|

Apache Spark is a lightning-fast cluster computing framework designed for fast computation. It is of the most successful projects in the Apache Software Foundation. Spark SQL is a new module in Spark which integrates relational processing with [...]

Apache Hive 

By |2022-07-04T13:41:40+00:00January 31st, 2018|Apache Hive, Informative, Technologies|

The Apache Hive™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage and queried using SQL syntax. Built on top of Apache Hadoop™, Hive provides the following features: Tools to enable easy [...]

Apache Pig

By |2022-07-04T13:49:15+00:00January 30th, 2018|Apache Pig, Informative, Technologies|

Apache Pig is a high-level language platform developed to execute queries on huge datasets that are stored in HDFS using Apache Hadoop. It is similar to SQL query language but applied [...]

CONTACT US