January 2018

Spark SQL

By Dj Das|2022-07-04T14:47:26+00:00January 31st, 2018|Informative, SparkSQL, Technologies|

Apache Spark is a lightning-fast cluster computing framework designed for fast computation. It is of the most successful projects in the Apache Software Foundation. Spark SQL is a new module in Spark which integrates relational processing with [...]

Apache Hive

By Dj Das|2024-07-31T13:08:10+00:00January 31st, 2018|Apache Hive, Informative, Technologies|

What does Apache Hive do? The Apache Hive™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage and queried using SQL syntax. Built on top of Apache Hadoop™, Hive provides the following [...]

Apache Mahout

By Dj Das|2024-07-31T13:12:25+00:00January 31st, 2018|Apache Mahout, Informative, Technologies|

What is Apache Mahout? Apache™ Mahout is a library of scalable machine-learning algorithms, implemented on top of Apache Hadoop® and using the MapReduce paradigm. Machine learning is a discipline of artificial [...]

Apache Flink

By Dj Das|2024-07-31T13:00:33+00:00January 31st, 2018|Apache Flink, Informative, Technologies|

Introduction to Apache Flink® Below is a high-level overview of Apache Flink and stream processing. Continuous Processing for Unbounded Datasets Features: Why Flink? Flink, the streaming model, and bounded datasets The “What”: Flink from [...]

Apache ZooKeeper

By Dj Das|2022-07-04T14:33:08+00:00January 31st, 2018|Apache ZooKeeper, Informative, Technologies|

Apache ZooKeeper Apache ZooKeeper is a distributed, open-source coordination service for distributed applications. It exposes a simple set of primitives that distributed applications can build upon to implement higher level services for synchronization, configuration [...]

Cloudera Impala

By Dj Das|2022-07-04T14:21:52+00:00January 31st, 2018|Cloudera Impala, Informative, Technologies|

Cloudera Impala provides fast, interactive SQL queries directly on your Apache Hadoop data stored in HDFS or HBase. In addition to using the same unified storage platform, Impala also uses the same metadata, SQL [...]

Apache Kafka

By Dj Das|2024-07-31T13:10:27+00:00January 30th, 2018|Apache Kafka, Informative, Technologies|

Apache Kafka We think of a streaming platform as having three key capabilities: It lets you publish and subscribe to streams of records. In this respect it is similar to a message queue or [...]

Apache Flume

By Dj Das|2024-07-31T13:02:23+00:00January 30th, 2018|Apache Flume, Informative, Technologies|

What is Apache Flume? Apache Flume is a distributed, reliable, and available system for efficiently collecting, aggregating and moving large amounts of log data from many different sources to a centralized data store. The [...]

Apache Spark

By Dj Das|2024-07-31T13:18:23+00:00January 30th, 2018|Apache Spark, Informative, Technologies|

What is Apache Spark? Apache Spark is a fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk. Apache Spark [...]

Apache Pig

By Dj Das|2024-07-31T13:14:14+00:00January 30th, 2018|Apache Pig, Informative, Technologies|

What is Apache Pig? Apache Pig is a high-level language platform developed to execute queries on huge datasets that are stored in HDFS using Apache Hadoop. It is similar to SQL [...]

Apache Mahout

Apache ZooKeeper

Apache Kafka

Apache Flume

Model Context Protocol

A Comparative Study Between LangGraph and LangChain for Enterprise AI Development

All About Emergent Behavior in Large Language Models

How Accounting and CA Firms are Using AI for Driving Growth

A Gala of MDM & Data Governance Use Cases: Building Responsible AI without Reckless Data – Part 3

The Rise of Patient Diagnostic Assistants in Enhancing Early Detection in Healthcare

Want quick ROI from AI projects? Try Low Code/No Code Platforms!

A Gala of MDM & Data Governance Use Cases: Building Responsible AI without Reckless Data – Part 2

A Technical and Business Perspective for Choosing the Right LLM for Enterprise Applications

A Gala of MDM & Data Governance Use Cases: Building Responsible AI without Reckless Data – Part 1

Primary Services

Pre-Built Applications

Data & AI Solutions

Get Exclusive Insights

Insights

Talk To Us