apache spark

Home » apache spark

Spark ML

By | 2018-08-01T07:25:18+00:00 February 6th, 2018|Spark ML, Technologies|

Spark.ml is a new package introduced in Spark 1.2, which aims to provide a uniform set of high-level APIs that help users create and tune practical machine learning pipelines. It is currently an alpha [...]

Azure Machine Learning

By | 2018-08-01T10:39:23+00:00 February 4th, 2018|Apache Spark, Artificial Intelligence, AWS Marketplace, Azure Machine Learning, Big Data, Blogs, Data, Data Sciences, Intelligent Analytical System, Technologies|

Azure Machine Learning is an integrated, end-to-end data science and advanced analytics solution. It enables data scientists to prepare data, develop experiments, and deploy models at cloud scale. The main components of Azure Machine [...]

Azure HDInsight

By | 2018-08-01T11:30:28+00:00 February 2nd, 2018|Azure HDInsight, Technologies|

Azure HDInsight is a fully managed, full-spectrum, open-source analytics service for enterprises. HDInsight is a cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. HDInsight also supports a [...]

Amazon EMR

By | 2018-08-01T12:07:12+00:00 February 1st, 2018|Technologies, Uncategorized|

What Is Amazon EMR? Amazon EMR is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, on AWS to process and analyze vast amounts of data. By using these [...]

Spark SQL 

By | 2018-08-01T12:51:19+00:00 January 31st, 2018|SparkSQL, Technologies|

Apache Spark is a lightning-fast cluster computing framework designed for fast computation. It is of the most successful projects in the Apache Software Foundation. Spark SQL is a new module in Spark which integrates relational processing with [...]

Apache Hive 

By | 2018-08-01T12:52:59+00:00 January 31st, 2018|Apache Hive, Technologies|

The Apache Hive™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage and queried using SQL syntax. Built on top of Apache Hadoop™, Hive provides the following features: Tools to enable easy [...]

Apache Flink

By | 2018-08-01T12:55:45+00:00 January 31st, 2018|Apache Flink, Technologies|

Introduction to Apache Flink® Below is a high-level overview of Apache Flink and stream processing. Continuous Processing for Unbounded Datasets Features: Why Flink? Flink, the streaming model, and bounded datasets The “What”: Flink from [...]

Load More Posts