Sahil Pathak

Sahil Pathak

Service Delivery Analyst

Followers of Sahil Pathak1000 followers
location of Sahil PathakDublin, County Dublin, Ireland

Connect with Sahil Pathak to Send Message

Connect

Connect with Sahil Pathak to Send Message

Connect
  • Timeline

  • About me

    Data Engineer & Analyst : Stamp 1G | 6.5 yrs exp | ETL, Informatica PC & BDM | Python | PL/SQL| PostgreSQL| Redshift| DW| Power BI | Big data | Hive| Hue| Kafka| Spark | Shell scripting| Ex-ZS| Ex-TCS| MSc Data Analytics

  • Education

    • University of Pune

      2011 - 2015
      Bachelor's of Engineering Computer Engineering
    • National College of Ireland

      2024 - 2025
      Master of Science - MS Data Analytics

      The core modules which I will be studying in the Master's degree are as mentioned below : ,• Data Mining and Machine Learning• Database and Analytics Programming• Statistics for Data Analysis• Modelling, Simulation and Optimization• Research in Computing• Business Intelligence and Analytics• Domain Applications of Predictive Analysis

  • Experience

    • ZS

      Jul 2015 - Jan 2019
      Service Delivery Analyst

      • Proficient in PL/SQL and Informatica development.• Built mappings with various transformations for the client 'Iroko' over a period of 2.5 years.• Designed and created mappings, workflows, and sessions using Informatica Power Center, utilizing mapping designer, workflow manager, and workflow monitor.• Possess a comprehensive understanding of data warehouse concepts and advanced SQL.• Analyzed sources and targets along with their key dependencies to ensure appropriate business-related information.• Designed and created mappings using various transformations including router, lookup, aggregator, union, sequence generator, sorter, and expression transformation.• Developed workflows and schedules using Informatica to load required dimension and fact tables.• Implemented efficient query writing techniques to handle data redundancy and executed DDLs and DMLs in DEV/SIT following client change requests.• Demonstrated proficiency in Linux shell scripting for creating batch jobs for monitoring purposes.• Conducted ETL development in SnapLogic, AWS Redshift, and RDS.• Performed ETL testing in DEV/SIT/PROD environments as per client requirements.• Managed a team of 4 people while working on the BMS Italy project, overseeing break fixes.• Orchestrated RDS creation for the BMS France project.• Documented business requirements, including technical and functional specifications containing data modeling information.• Oversaw the migration of Informatica code and parameter files to environments like DEV, UAT, exporting XMLs for deployment across various environments.• Engaged in offshore team discussions to analyze business requirements with the client. Show less

    • Tata Consultancy Services

      Jan 2021 - Jan 2024
      Systems Engineer (Big data developer)

      • Proficient in utilizing Informatica BDM (Big Data Management) version for end-to-end data integration and management tasks.• Engineered advanced data mappings with multi-layered logic and transformations, streamlining data processing workflows and improving pipeline efficiency by 30%.• Designed and deployed scalable cloud solutions on AWS, utilizing services like EC2, S3, Lambda, and RDS to enhance performance and reliability.• Applied subject matter expertise in Big Data and distributed systems, optimizing data pipelines for large-scale datasets and improving resource usage.• Automated data workflows, including ingestion, aggregation, and ETL processes, resulting in reduced manual effort and improved operational efficiency.• Hands-on experience in HiveQL for data manipulation, transformation, and analysis tasks.• Optimized query performance by 30% and reduced data retrieval time by 40% through the implementation of Hive partitioning, bucketing, and advanced optimization techniques for large-scale datasets.• Proficient in Hue's features for querying, exploring, and visualizing data stored in Hadoop Distributed File System (HDFS).• Successfully scaled existing pipelines to handle 10x larger datasets, improving processing time by 30%.• Skilled in leveraging Python's data analysis capabilities to extract actionable insights from large datasets.• Skilled in integrating Python scripts and libraries with various data sources, CI/CD pipelines including databases, APIs, and file formats such as CSV, JSON, and Excel.• Leveraged creativity and strategic thinking to design innovative data architectures and workflows, ensuring alignment with evolving business goals and emerging technologies.• Demonstrated exceptional analytical and problem-solving skills by identifying and resolving complex data engineering challenges, optimizing workflows, and ensuring seamless data integration across systems. Show less

    • Circle K

      Sept 2024 - now
      Customer Assistant
  • Licenses & Certifications