spark

Home » spark

Data Normalization with Spark

By | 2019-01-02T07:34:58+00:00 November 27th, 2018|Pranab Ghosh|

Data Normalization with Spark Data normalization is a required data preparation step for many Machine Learning algorithms. These algorithms are sensitive to the relative values of the feature attributes. Data normalization is the process of bringing all the [...]

Azure Data Factory

By | 2019-01-31T10:14:45+00:00 February 2nd, 2018|Azure Data Factory, Technologies|

Azure Data Factory In the world of big data, raw, unorganized data is often stored in relational, non-relational, and other storage systems. However, on its own, raw data doesn't have the proper context or [...]

Azure HDInsight

By | 2018-08-01T11:30:28+00:00 February 2nd, 2018|Azure HDInsight, Technologies|

Azure HDInsight is a fully managed, full-spectrum, open-source analytics service for enterprises. HDInsight is a cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. HDInsight also supports a [...]

Apache Spark 

By | 2018-08-01T13:02:14+00:00 January 30th, 2018|Apache Spark, Technologies|

Apache Spark is a fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk. Apache Spark has an advanced DAG [...]

Load More Posts
CONTACT US