Siddhesh Sheth

Siddhesh Sheth

AWS Data Engineer Intern

Followers of Siddhesh Sheth1000 followers
location of Siddhesh ShethIndianapolis, Indiana, United States

Connect with Siddhesh Sheth to Send Message

Connect

Connect with Siddhesh Sheth to Send Message

Connect
  • Timeline

  • About me

    Actively Looking for Opportunities | Ex Accenture | Data Engineering | Data Analysis | Python | SQL | AWS Certified Cloud Practitioner |

  • Education

    • Indiana University Bloomington

      2022 - 2024
      Master's degree Computer Science 3.97/4.00
    • Savitribai Phule Pune University

      2017 - 2021
      Bachelor of Engineering - BE Computer Science 4.00/4.00
  • Experience

    • Jarvi Technologies

      Jul 2020 - Oct 2020
      AWS Data Engineer Intern

      •Designed a scalable AWS architecture with SageMaker, achieving 95% accuracy and reducing training time by 40% on 1GB+ datasets.•Used AWS Lambda for automated preprocessing, handling 1M+ records per hour, reducing prep time by 50%. •Implemented Elastic Load Balancing and CloudWatch and optimized costs by 30%.

    • Accenture

      Aug 2021 - Aug 2022
      Data Engineer

      •Oversaw the migration, updates, and backups of AT&T databases for Financial Billing Operations through SSMS from SQL Server to Azure SQL Database, guaranteeing no data loss and a 15% reduction in recovery time.•Facilitated the migration of 100+ SQL Server databases using Azure DMS and DMA, refining schema mapping and data replication, leading to a 20% reduction in migration time and ensuring seamless data transfer. •Supervised a team as the RSA Archer Admin, managing access roles, addressing 50+ weekly permission and security queries, and customizing the system to meet GRC requirements, reducing GRC-related issues by 75%.•Monitored and auto-scaled Azure VMSS based on real-time usage, reducing resource wastage by 15%, maintaining 99.9% uptime, and cutting infrastructure costs by 10%, saving time and effort by minimizing manual interventions. •Managed Astra alerts and implemented real-time threat detection, reducing security risks by 20%, with weekly audits and vulnerability patching improving system resilience and cutting GRC incidents by 30%. Show less

    • ScotFin Consulting Co.

      May 2023 - Aug 2023
      Data Engineer

      •Designed and implemented an automated ETL pipeline to extract, transform, and load tax filing data from multiple sources such as IRS APIs and internal databases, processing 500,000+ records daily using Python and SQL. •Developed a compliance monitoring module using machine learning models to flag discrepancies and audit risks in tax filings, achieving 92% accuracy and reducing manual audit time, ensuring faster resolution of compliance issues.•Built interactive dashboards in Power BI for real-time tracking of tax filing statuses, compliance alerts, and resolution outcomes. Show less

    • Indiana University Bloomington

      Aug 2023 - May 2024

      •Led the design and indexing of SQL database schema for transactions, orders, inventory, and departments, reducing data storage by 20% and improving query performance, enabling efficient sales analysis and inventory management.•Extracted inventory data from Excel files using openpyxl and transaction data from the SQL database with optimized SQL queries, streamlining data integration processes and reducing manual data handling by 60%.•Implemented advanced transformations, including data type conversion and normalization, while conducting rigorous data wrangling and validation to ensure data integrity and accuracy for downstream reporting.•Utilized SQL for EDA and created interactive Power BI dashboards with DAX metrics to visualize sales trends, resulting in a 15% rise in yearly revenue and improved decision-making efficiency through real-time insights. Show less

      • Data Analyst

        Aug 2023 - May 2024
      • Graduate Teaching Assistant

        Aug 2023 - Dec 2023
    • Hoosier Community Network

      May 2024 - now
      Data Analyst

      •Ingested CDC healthcare data into pandas DataFrames using Python and sodapy library for efficient access to the CDC’s Socrata open data RESTful API, enabling comprehensive analysis of healthcare metrics and trends.•Deployed a PostgreSQL database on Heroku, utilizing SQLAlchemy for ORM to efficiently load and manage large DataFrames, reducing query execution time by 30% and improving data processing speed for real-time analytics.•Created interactive dashboards by connecting the PostgreSQL database to Metabase, enhancing data exploration through SQL queries, allowing us to observe disease patterns and healthcare trends. Show less

    • XAI

      Aug 2024 - now
      AI Tutor - Data

      • Labeled and curated data for GROK by using proprietary software to support model training and evaluation.• Collaborated with cross-functional technical teams to design and refine annotation tools, optimizing data labeling workflows and improving the efficiency of AI model development processes.• Conducted comprehensive evaluations of AI-generated content, providing detailed feedback to improve model performance.

  • Licenses & Certifications

    • Getting Started with AWS Machine Learning

      Coursera
      Apr 2020
      View certificate certificate
    • IT Academy: Network Virtualization Concepts

      VMware
      May 2020
      View certificate certificate
    • ICSI | CNSS Certified Network Security Specialist

      ICSI (International CyberSecurity Institute), UK
      May 2020
    • MTA: Introduction to Programming Using Python - Certified 2020

      Microsoft
      Jul 2020
      View certificate certificate
    • IT Academy: Software Defined Storage Concepts

      VMware
      May 2020
      View certificate certificate
    • Google Cloud Platform Fundamentals: Core Infrastructure

      Coursera
      Jul 2020
      View certificate certificate
    • Introduction to Cybersecurity Tools & Cyber Attacks

      IBM
      May 2020
    • AWS Fundamentals: Going Cloud-Native

      Coursera
      Apr 2020
      View certificate certificate
    • AWS Fundamentals Specialization

      Coursera
      Apr 2020
      View certificate certificate
    • Face Detection App

      ETHNUS
      Apr 2020
      View certificate certificate
  • Volunteer Experience

    • Robin

      Issued by Robin Hood Army on Aug 2022
      Robin Hood ArmyAssociated with Siddhesh Sheth