- Developed & supported a Hadoop Data Warehousing Platform – both on premise & on cloud.
- Developed Data Pipelines for complex multi source, multi formats ingestions.
- Performance Tuning & Optimization of HiveQL queries needed for DW operations.
- Supported SAS based Data Scientists to migrate to Hadoop platform.
Technologies Used: Cloudera Hadoop, MapReduce, Hive, Pig, HBase, Impala, SASM, Amazon Redshift, S3.