Pranab Ghosh

Home » Pranab Ghosh

Cassandra Range Query Made Simple

By | 2019-01-02T08:56:51+00:00 November 28th, 2018|Pranab Ghosh|

In Cassandra, rows are hash partitioned  by default. If you want to data sorted by some attribute, column name sorting feature of Cassandra is usually exploited. If you look at the Cassandra slice range [...]

Hive Plays Well with JSON

By | 2018-11-28T11:08:04+00:00 November 28th, 2018|Pranab Ghosh|

Hive Plays Well with JSON Hive is an abstraction on Hadoop Map Reduce. It provides a SQL like interface for querying HDFS data, whch accounts for most of it’s popularity.  In Hive, table structured data [...]

Data Normalization with Spark

By | 2019-01-02T07:34:58+00:00 November 27th, 2018|Pranab Ghosh|

Data Normalization with Spark Data normalization is a required data preparation step for many Machine Learning algorithms. These algorithms are sensitive to the relative values of the feature attributes. Data normalization is the process of bringing all the [...]

Anomaly Detection with Robust Zscore

By | 2019-07-17T13:47:16+00:00 November 27th, 2018|Pranab Ghosh|

Anomaly Detection with Robust Zscore Anomaly detection with with various statistical modeling based techniques are simple and effective. The Zscore based technique is one among them. Zscore is defined as the absolute difference between [...]

Load More Posts