Rakesh Singh

Rakesh Singh

Software Engineer

Followers of Rakesh Singh3000 followers
location of Rakesh SinghNew York City Metropolitan Area

Connect with Rakesh Singh to Send Message

Connect

Connect with Rakesh Singh to Send Message

Connect
  • Timeline

  • About me

    Data Engineer | Python | PySpark | Azure( Databricks, Data Factory) | Snowflake | Informatica Cloud, BDM, PowerCenter, IDQ

  • Education

    • University of Mumbai

      -
      Master of Computer Applications - MCA
  • Experience

    • Key Human Resources Private Limited

      Nov 2008 - Jun 2010
      Software Engineer

      - Responsible for data extraction from Data Warehouse as per the business requirement, - Built adhoc SQL queries to extract data from Data Warehouse to run the campaign.- Automated manual data extraction using stored procedure and scheduled it as per business requirement.- Responsible for batch monitoring and support for the application

    • CMC LTD

      Jun 2010 - Dec 2010
      IT Engineer

      - Designed stored procedure, views, triggers and functions to implement complex business logic- Developed C program, Unix scripting to execute database procedure and another database component- Involved in design, development and testing of the Derivatives Trading & Settlement application- Performed the performance tuning of SQL Queries, Stored procedure- Responsible for the application support

    • On Demand Agility

      Dec 2010 - May 2013
      Senior Software Engineer

      - Responsible for analyzing, designing and developing ETL strategies - Developed complex data transformation logic, Mapplets and worked with different components of the workflow manager- Good exposure in creating and debugging ETL/IDQ mappings, SQL/T-SQL, PL/SQL, Stored Procedures and Functions- Developed shell-scripts to automate the execution and propagation of ETL and IDQ mapping across different environments- Created Autosys Jobs to schedule IDQ mappings, PowerCenter workflows- Collaborated with system owners to identify data for quality assessment and - Used Informatica Analyst tool to run source column profiling on data to assist data stewards to identify gaps in source data while gathering requirements- Implemented data quality processes including transliteration, parsing, analysis, standardization techniques using reference tables and standardizer transformation and enrichment at point of entry and batch modes - Developed data quality mappings using IDQ to cleanse, de-duplicate, and capture exception.- Used Informatica developer tool to design and develop custom objects, rules, matching strategy and import/export mappings Show less

    • Nomura

      May 2013 - Sept 2019

      - Worked on data ingestion and transformation of large sets of structured and unstructured data.- Worked in Apache Spark for data transformation and built three-way reconciliation report.- Designed and created Hive external tables using shared meta-store- Developed Hive queries for Analysis across different banners- Implemented Hive static partitioning, dynamic partitioning and buckets for performance optimization.- Written Hive jobs to parse the logs and structure them in tabular format to facilitate effectively querying on the log data.- Developed and optimized Sqoop jobs for import and export data into HDFS and Hive.- Used Oozie to schedule the workflow to manage Apache Hadoop jobs Show less - Worked as a technical lead for development and maintenance of Data Warehouse and Data Governance platform using Informatica Powercenter & Data Quality (IDQ) tool- Converted functional requirements into solution design and technical specifications- Designed and maintained database model using Ralph Kimball methodology (star schema)- Designed, maintained and tuned ETL solutions using Informatica Powercenter- Developed and scheduled Autosys batch jobs- Worked closely with business counterparts for agreeing on IT solutions and timelines forimplementation- Responsible for capacity planning of Unix servers (Unix cores, storage), databases for the current and future workloads- Responsible for designing housekeeping jobs to ensure proper system health checkups- Responsible for providing stakeholders with project status and progress summary- Responsible for Installation, configuration management, version upgrade and HotFix of Data Quality & PowerCenter on Linux platform for the versions 9.* and 10.*- Responsible for Implementation of the security privileges in the application which involves the creation of the group, roles, and assigned these privileges to each user group.- Responsible for the migration of Informatica Mappings/Sessions/Workflows from Dev, QA to Prod environments. Show less

      • Data Engineer

        Apr 2018 - Sept 2019
      • ETL and Data Quality Consultant

        May 2013 - Mar 2018
    • Scotiabank

      Jan 2020 - Mar 2020
      Data Engineer
    • Tata Consultancy Services

      Apr 2020 - Dec 2021
      Assistant Consultant
    • Cognizant

      Dec 2021 - now
      Manager
  • Licenses & Certifications

    • Big Data Hadoop and Spark Developer

      Simplilearn
      Jul 2019
    • Tableau Consumer

      Tableau Software
      Apr 2020
      View certificate certificate