Bringing Big Data to the Masses
A true leader does not follow trends, he initiates them. Dj Das, CEO, Third Eye Consulting Services and Solutions personifies this fact. Armed with strong foresight that Big Data technologies would be the next [...]
A true leader does not follow trends, he initiates them. Dj Das, CEO, Third Eye Consulting Services and Solutions personifies this fact. Armed with strong foresight that Big Data technologies would be the next [...]
Amazon Redshift System Overview Amazon Redshift is an enterprise-level, petabyte scale, fully managed data warehousing service. Topics Data Warehouse System Architecture Performance Columnar Storage Internal Architecture and System Operation Workload Management Using Amazon Redshift [...]
MapReduce Tutorial: Introduction In this MapReduce Tutorial blog, I am going to introduce you to MapReduce, which is one of the core building blocks of processing in Hadoop framework. Before moving ahead, I would [...]
TensorFlow Architecture We designed TensorFlow for large-scale distributed training and inference, but it is also flexible enough to support experimentation with new machine learning models and system-level optimizations. This document describes the system architecture [...]
Introduction The Apache TEZ® project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data. It is currently built atop Apache Hadoop YARN. The 2 main design [...]
OVERVIEW A system for processing streaming data in real time Apache™ Storm adds reliable real-time data processing capabilities to Enterprise Hadoop. Storm on YARN is powerful for scenarios requiring real-time analytics, machine learning and [...]