Amit Sheth

Amit Sheth

Software Applications Manager

Followers of Amit Sheth612 followers
location of Amit ShethRichmond, Virginia, United States

Connect with Amit Sheth to Send Message

Connect

Connect with Amit Sheth to Send Message

Connect
  • Timeline

  • About me

    Director, Software Engineering at Capital One

  • Education

    • Fr Conceicao Rodrigues College of Engineering

      1998 - 2001
      Bachelor of Engineering (B.E.) Electronics engineering
    • New Jersey Institute of Technology

      2001 - 2003
      Masters Electrical Engineering

      Major in Advance VLSI design

  • Experience

    • Configuration Management Inc.

      Feb 2004 - Apr 2014
      Software Applications Manager
    • CMI

      Feb 2004 - Apr 2014
      Software Applications Manager
    • Capital One

      Feb 2004 - now

      Leading multiple Software Engineering teams for Enterprise Observability which support internal applications built using Open Source technologies to improve Observability using advanced analytics and machine learning/Generative AI at scale in real time. These include:- ML Ops - designed and led engineering teams to built an internal tool to perform Machine learning based Anomaly detection and Fault localization across millions of Observability metrics in near real time using Open source products and unsupervised machine learning models. Also designed a product that builds application dependency/lineage by scraping traces and metadata across distributed platforms and systems. Both of these products are actively being used to generate insights at Capital One.- Building a solution leveraging Generative AI to assist with incident troubleshooting.- Identified opportunities to realize Cloud cost savings of $10M/year on an ongoing basis- Observability specification, standardization and governance product - came up with the concept and built a team of engineers supporting automation of default monitoring at Capital One- Enterprise Observability Visualization platform - leading at team which builds visualizations for Observability needs using Open Source (Angular/React) and vendor products- Also leading a tiger team of Sr. Software and SRE engineers to analyze recent high severity incidents and close gaps in Observability for those applications using existing or new solutions, and also apply these learnings across other software applications in the enterprise-Partner with Customers, Product and Sr. Leadership on strategy and roadmap. Show less Lead the Engineering tower of Technology Automation and Optimization group at Capital OneResponsibilities include engaging with internal customers along with Product and Data Science partners to strategize solutions.Started in this role as an Individual Contributor, created a product roadmap and attracted internal and external talent to build multiple teams from scratch for Software, Data and Platform Engineering to deliver these products.Individually prototyped a brand new product to automatically discover inter/intra application lineage. Demonstrated this prototype to leadership to get buy-in and lead a team of engineers to build this product.Successfully released a large scale machine learning platform that performs anomaly detection on millions of time-series in real time.Sustained high levels of employee retention by investing in continuous development and learning of our engineers to stay updated on latest technologies that give a cutting edge to our products.Nominated and graduated from Capital One's Technical Leadership Development Program. Technical Interview Council member - responsible for developing design questions for tech interviewing and performing data analysis on Interview quality. Show less Leading a team of software and data engineers in the Automation and Optimization group at Capital One.Designing and building the next generation of monitoring products using Machine Learning and Big Data to prevent or quickly detect incidents as they happen and also perform advanced root cause analysis in order to reduce customer impact.Leveraging Open source tools and technologies, internally developed products, capabilities and pipelines, public cloud services and external vendor/SaaS products as appropriate. Show less Responsible for Monitoring Governance, Infrastructure monitoring on-prem and Multi-Cloud environments at Capital One.Built/supported next generation cloud monitoring tools using Open source tools for Enterprise scale, performance, resiliency and reliability.Worked as individual contributer (I.C.) Sr. Manager in the Enterprise Monitoring team.Designed, architected and worked as Product Owner for the Monitoring Governance product at Capital One. Created a multi-year roadmap for this product and provided technical direction to a team of software and data engineers to build this product. This product is aimed at identifying gaps in monitoring and alerting across the enterprise using a a data-driven approach. This has been very well received internally and externally and has received our tech-excellence award.Also served as Product Owner for Splunk and Zabbix products primarily to improve the resiliency posture of these products and improve their stability.Also worked on SSH Key Governance project and wrote code that runs on our entire EC2 fleet. Show less  Successfully performed Proof Of Concept with Ab Initio on EC2 with different configurations of EC2 servers, storage (EBS) Architected all aspects of the design including specific configurations for Server type, Storage, VPC, Security groups, in-region and out-of-region resiliency and data replication and EC2 rehydration Built one of the first automation scripts to support bulk transfer of data to/from on premise servers and AWS S3/Glacier Storage using compression techniques and end-to-end encryption. This script was used to move several 100’s of Terabytes of archival data from Legacy Tape Archive (TSM) to Glacier StorageImplemented Ab Initio on Hadoop Based Infrastructure at Capital One. Primary technical Point-of-Contact for this project Directed a factory team of 13 Onsite/Offshore resources to upgrade over 10000+ scripts/graphs to work with new Ab Initio Co>Op software, OS version, SSH version and KSH version on top of Hadoop Infrastructure Designed, developed and implemented in-house replication tool for in-region failover (North/South in-Data Center) and out-of-region DR failover (across Data Centers) Built server backup solution in partnership with Enterprise Storage team using NFS snapshotting Reduced server footprint and saved over $300,000 in hardware costsPerformed upgrades of existing legacy Linux Infrastructure: Worked with Enterprise Storage/UNIX teams to upgrade legacy environment to support 3 Data Center failover providing in-region and out-of-region (DR) failover as per Federal Audit requirements Designed and implemented strategy to upgrade SSH version from Commercial to OpenSSH without any impact to 100’s of incoming/outgoing connections as well 100’s of users Built an upgrade checker-script to identify issues that need to be remediated while upgrading from Ab Initio 2.15 Co>Op to 3.1 Co>Op Show less

      • Director Software Engineering

        Sept 2022 - now
      • Director, Software Engineering

        Jul 2020 - Sept 2022
      • Lead Software Engineer

        Jan 2019 - Jul 2020
      • Lead Software Engineer

        Apr 2017 - Jan 2019
      • Master Platform Engineer

        Apr 2014 - Apr 2017
      • Software Configuration Management Consultant

        Feb 2004 - Apr 2014
  • Licenses & Certifications