Sumiya Sulthana

Sumiya Sulthana

Hadoop Developer

Followers of Sumiya Sulthana2000 followers
location of Sumiya SulthanaDubai, United Arab Emirates

Connect with Sumiya Sulthana to Send Message

Connect

Connect with Sumiya Sulthana to Send Message

Connect
  • Timeline

  • About me

    Senior Data Engineer l Spark l Scala l Hive l Azure l AWS | ELK

  • Education

    • Anil Neerukonda Institute Of Technology & Sciences

      2010 - 2014
      Bachelor of Engineering (BE) Electrical and Electronics Engineering
  • Experience

    • Capgemini

      Nov 2014 - Jan 2018
      Hadoop Developer

      • Creation of technical metadata in Hive using the mapping documents.• Developed the code to generate the DDL queries to create the tables in Hive based on the technical metadata.• Loading the files into Hive across different schemas on daily partition and performance optimizations in Hive.• Data validation in Hive with the source schema generated.• Provided the solution for reconciliation reports to ensure that no data loss from application source to Hive after loading.• Familiar with Spark framework and hands on experience in Scala using Eclipse with scala IDE & SBT.• Performed various spark core transformations and actions on top of data.• Involved in unit testing, regression testing and prepared the test cases.• Worked on falcon and error analysis using Oozie & logs from the Job history server. Show less

    • RENAULT NISSAN MITSUBISHI ALLIANCE, INDIA

      Feb 2018 - Nov 2021
      Data Engineer

      • Implemented End-To-End Big Data applications from scratch in Hadoop ecosystem• Responsible for building scalable distributed data solutions using Spark.• Worked with various files format like CSV, JSON, ORC, AVRO and Parquet• Optimized spark jobs using various optimization techniques• Responsible for developing custom UDFs and UDAFs in Hive

    • Innova Solutions

      Sept 2021 - May 2022
      Senior Data Engineer

      • Worked on Healthcare product development in the AWS platform• Implemented end-to-end big data application (from Data Discovery to Reporting)• Integrated AWS S3, Glue, EMR, Lambda and Apache Druid to amplify data storage and analysis capabilities, yielding a significant 30% enhancement in data accessibility and processing efficiency.

    • Dubai Technologies

      May 2022 - now
      Senior Data Engineer

      • Lead deep technical engagements with partners, including Briefings, Proof of Concepts• Worked on Azure POCs - ADLS, Delta Lake, Databricks, Data Factory• Implemented Data pipeline to ingest data in DataLake• Collaboration with stakeholders to define data requirements and data models• Optimized data ingestion applications for higher performance

  • Licenses & Certifications

    • Prophecy Data Transformation Copilot for Data Engineering

      Udemy
      Jun 2024
      View certificate certificate
    • Hortonworks Certified Associate

      Hortonworks
      Apr 2017
      View certificate certificate
    • Microsoft Certified: Azure Developer Associate

      Microsoft
      Jan 2024