Saeed Gavidel, PhD

811 followers

Plymouth, Michigan, United States

Connect with Saeed Gavidel, PhD to Send Message

Connect

Connect with Saeed Gavidel, PhD to Send Message

Connect

Timeline
About me
Sr. Lead AI-Data Scientist | Technical Leader | Solution Strategist & Architect | GenAI | Causal AI | Optimization | Electric Vehicle Analytics | eCommerce & Inventory Analytics | Data Quality Specialist
Education
- Wayne State University
  2015 - 2018
  Master's Computer Science- Big Data 3.96/4
  I worked (August 2015 to April 2018) under the supervision of Prof. Shiyong Lu the director of the Big Data Lab at Wayne State University. My Computer Science Master's Thesis titled as "STATISTICAL APPROACH TO PERFORMANCE COMPARISON OF PREDICTIVE ALGORITHMS: APPLICATION IN RESISTANCE SPOT WELDING" has been devoted to design and development of a new performance assessment framework to scientifically compare predictive performance of modern predictive models Deep Neural Nets (DNNs), Support… Show more I worked (August 2015 to April 2018) under the supervision of Prof. Shiyong Lu the director of the Big Data Lab at Wayne State University. My Computer Science Master's Thesis titled as "STATISTICAL APPROACH TO PERFORMANCE COMPARISON OF PREDICTIVE ALGORITHMS: APPLICATION IN RESISTANCE SPOT WELDING" has been devoted to design and development of a new performance assessment framework to scientifically compare predictive performance of modern predictive models Deep Neural Nets (DNNs), Support Vector Machines (SVMs), Random Forests (RF), and etc. to generate Weldability Prediction solutions under big and inconsistent data conditions. In this thesis, statistical techniques such as Monte Carlo Simulations, Bootstrapping, hypothesis testing techniques like paired-T-tests and Levene's tests, Goodness of Fit Tests have been extensively used to perform the research. Show less
- Wayne State University
  2014 - 2018
  Doctor of Philosophy - PhD Systems Engineering: Statistical Modeling and Data Analytics 3.98/4
  Activities and Societies: Institute of Industrial and Systems Engineers INFORMS My Ph.D. dissertation research project titled as "Systematic Data-Driven Client Prioritization/Triage in Service Industries: with Applications in Remanufacturing Services" is focused on the operational analysis and management of generic service systems where client sortation matters and affects profitability KPIs of the serving system. My special focus is on generating optimal/near optimal sortation solutions for service systems under risky and extremity conditions. • Dissertation Support… Show more My Ph.D. dissertation research project titled as "Systematic Data-Driven Client Prioritization/Triage in Service Industries: with Applications in Remanufacturing Services" is focused on the operational analysis and management of generic service systems where client sortation matters and affects profitability KPIs of the serving system. My special focus is on generating optimal/near optimal sortation solutions for service systems under risky and extremity conditions. • Dissertation Support Award, WSU, 2017.• Graduate and Post-Doc Research Symposium, Best Research Award, WSU, 2016 and 2017;• Industrial and Sytems Engineering Best Graduate Student Research Presentation Award, 2016; Show less
- University of Tabriz
  -
  Bachelor's degree Mechanical Engineering, Manufacturing and Production
- Amirkabir University of Technology - Tehran Polytechnic
  -
  Master's degree Mechanical Engineering, Manufacturing and Production
Experience
- Tabriz Oil Refining Co. (TZORC)
  May 2008 - Oct 2013
  • Certified Energy Management Expert (Japan International Cooperation Agency - JICA)• Deployment of Energy Management System (EMS) according to ISO-50001 at refinery level• “Benchmarking, Standard Setting and Energy Conservation Program for Oil Refineries” project member• Contributed to development of energy consumption model for refinery steam network • Monitoring and benchmarking energy performance of thermal and electrical energy systems• Controlled data quality provided by out-of-house partners and contractors• Trained, validated and tested a regression model to predict energy consumption at refinery level• Generated multiple regression models to predict energy consumption of furnaces, exchangers and steam turbines• Leading energy management projects: “Steam Trap Energy Management Program”• Extraction, Transformation, Loading (ETL) of energy data in large volumes • Conducted statistical simulation (Monte Carlo) to simulate energy consumption scenarios Show less Reliability Engineer & Data Analyst, Tabriz Oil Refining Company, Tabriz, 2009 to 2013• Extraction, Transformation, Loading (ETL) of equipment data in large volumes • Applied statistical predictive models like linear regression in maintenance engineering• Constructed statistical linear regression model for failure prediction/prevention of rotary machines• Generated predictive regression model based on vibration data to predict/prevent ball/roller bearing failure• Constructed predictive classification and regression models based on ultrasonic condition assessment data• Generated statistical pattern recognition model to infer failure patterns based on repair/maintenance history• Implemented predictive/preventive maintenance programs on safety-sensitive equipment like safety valves • Computerized Maintenance Management Systems (CMMS) development/deployment team member• Supervised and managed maintenance projects like Steam Network Maintenance Project • Constructed multivariate stochastic predictive model to assess repair costs of industrial valves repair operations Show less
  - Data Analyst & Energy Management Engineer
    Aug 2009 - Oct 2013
  - Predictive Maintenance Senior Engineer
    May 2008 - Oct 2013
- Wayne State University
  Sept 2014 - Dec 2018
  Graduate Research Assistant
  In this position, I pursued my PhD while serving as a Graduate Research Assistant. Specifically, I from September 2016 to May 2018, I was engaged in VRWP (Virtually guided Resistance spot Welding Project) that was a joint project where Wayne State University, Ford Motor Company, and Digital Manufacturing and Design Innovation Institute (DMDII) collaborated to construct weldability assessment solutions for Advanced High Strength Steels (AHSS) materials used in the automotive industry. Followings are some of my deliverables in this project:• Constructed a portfolio of predictive models including DNN-MLP, SVM, GAM, DNN, RF, KNN, CART, and M5P to predict the quality of vehicles RSW joint. I have developed these models using R. The developed models have been productionized at Ford Motor Company. The prediction accuracy of 97% achieved by DNN-MLP algorithm. As a result, 20% to 25% of Weld Engineers time saved during joint design stage.• Statistically simulated spot welding process and generated “weld lobe” heatmap, a key engineering tool to design a spot welding joint. I used parametric bootstrapping combined with Monte Carlo scenario generation to deliver this data product. This data product operationalized at Ford Motor Company.• Designed and developed a model selection framework based on Hypothesis Testing where both prediction accuracy and prediction precision are integrated into model selection process. In this framework, prediction accuracy of predictive models compared by using t-tests and prediction precision is compared by using Leven’s test. This work is published in International Journal of Advanced Manufacturing Technology.• Designed, developed, and operationalized the “Progressive Sampling Algorithm” to minimize the number of expensive experiments to collect data and train the predictive models. In this product, I employed Learning Curves to adjust the number of experiments. Show less
- Consumers Energy
  Jun 2018 - Jan 2019
  Data Scientist-CPT (Big Data Analytics)
  - Major Duty: Theft/Fraud Detection and Revenue Protection- Design, development, operationalization, and productionalization of regular and elastic machine learning algorithms and analytical pipelines (proved efficiency) to analyze and detect/predict theft/fraud events and protect company revenue.- Exposed to +3 Petabyte (+500 billion records) of data with high veracity - Employed Apache Spark, R, Python, and memSQL to construct the algorithms- Conducted ETL operations to extract data from large Data Lakes- Constructed multiple elastic Big Data Analytics algorithms to analyze consumption behavior of consumers- Discovered/detected and mitigated a technical defect in electric smart meters known as “18129 Defect”- Developed multiple Statistical Outlier Detection algorithms to detect electricity theft and protect CE’s revenue Show less
- Traxen
  Jan 2019 - Sept 2019
  Data Scientist-OPT (Data Science and Big Data Engineering)
  I have designed, developed, tested, and operationalized severalBig Data solutions for connected vehicle and smart mobility applications with special focus on conservation of energy.These Big Data solutions include scalable and elastic data pipelines to ingest, manage, and analyze sensory, operational, and consumer data received from multiple data sources. These solutions are being patented. Followings are high-level descriptions of my activities at Traxen:• Design, development, and implementation (by using Apache Spark, Python, and R) of a Big Data ingestion platform conforming with industry standards like SAE-J1339.• Construction of a composite Big Data Management System. In this composite system, traditional data management systems like MySQL and modern technologies like Apache Parquet and Avro have been employed to construct a composite Data Lake.• Design, development, and operationalization (by employing Python and PySpark) of an elastic Big Data platform for decoding, transformation, and management of CAN BUS system sensory data.• Design, development, and implementation of Normalized Driver Fuel Efficiency Performance Comparison Algorithm • Developing statistical tests according to industry requirements like SAE-J1321, and SAE-J1521 standards.• Constructing (using Spark MLlib) an elastic Random Forest predictive model to predict Fuel Efficiency.• Design, development, and operationalization of Big Data Stream analysis platform for the autonomous driving data.This framework has been constructed by using Spark Stream platform in PySpark environment.• Supporting development of a Reinforcement Learning algorithm for heavy vehicle Smart Mobility and Energy Conservation. In this task I provided Monte Carlo simulations of drive scenarios to pre-train an RL agent based on Experience Replay approach.• Mentoring junior Data Engineers, Developers, and Machine Learning Engineers. Show less
- Ford Motor Company
  Sept 2019 - now
  Data Scientist (HTHD) Team Lead
  As a High Tech High Demand Data Scientist and Big Data Engineer at Ford Motor Company GDI&A, SCA-V (September 2019 to present) GDI&A-SCAV Connected Vehicle (CV) Decode Data Quality Monitoring Team is responsible to ensure that high-quality CV, sensory, and customer data flows in the Ford’s analytical pipelines. I lead the team and we have successfully built +30 Big Data and AI/ML products including an intelligent Auto AI/ML system known as CV Data Quality Monitoring Framework (DQMF). The DQMF is a productionized Big Data quality assessment, prediction, monitoring, and reporting system equipped with 48 ETL and analytical engines. DQMF processes several hundred terabytes of Ford data and sinks the results into a real-time reporting dashboard. I have architected the DQMF. Spark, Scala, Python, Java,and Hive are for development and productionization purposes.• Architected, coded, and implemented the ETL engines on top of HDFS, designed and constructed the staging and permanent storage, built predictive and anomaly detection engines of CV-DQMF. The ETL engines are developed by using Scala/Spark and PySpark. The storage system has snowflake schema and constructed by using Hive. Python, R, Scala/Weka are employed to build the predictive and detection engines.• Designed, developed, and operationalized Volume Monitoring Engine that monitors the volume of streaming CV data into Hive tables. The engine intelligently learns dynamic thresholds to flag anomalous streams. I have used Random Forest, Ridge, LASSO, SVM, KNN, and CART algorithms to learn the thresholds. Random Grid Search is used to tune the hyperparameters and bootstrapping was used to estimate 95% CI. Python and its Scikit-learn framework is employed to develop this engine. Finally, matplotlib is utilized to visualize the output. In February 2021, the engine detected ~23 M CV records being archived in HDFS without being processed and ingestion in Hive tables. This is considered waste prevention at Ford. Show less
Licenses & Certifications
- Google Cloud Platform Fundamentals: Core Infrastructure
  Google Cloud - Minnesota
  Apr 2021
- Google Cloud Platform Big Data and Machine Learning Fundamentals
  Google Cloud - Minnesota
  Mar 2021
- Bets Poster Award
  Wayne State University
  Mar 2016
- Lean Six Sigma DMAIC Methodology Course- Green Belt Lavel
  Wayne State University
  May 2015
- Managing an Agile Team
  University of Virginia
  Mar 2021
- Communicating Business Analytics Results
  University of Colorado Boulder
  Jan 2021
- Leadership Communication for Maximum Impact: Storytelling
  Northwestern University
  Mar 2021
- Structuring Machine Learning Projects
  DeepLearning.AI
  Feb 2021
- Neural Networks and Deep Learning
  DeepLearning.AI
  Feb 2021
- Tools for Data Science
  IBM
  Sept 2020
- Big Data Modeling and Management Systems
  San Diego Supercomputer Center
  Jun 2020
- Data Science Ethics
  University of Michigan
  Jul 2020
- Convolutional Neural Networks
  DeepLearning.AI
  Mar 2021
- Big Data Emerging Technologies
  Yonsei University
  Jul 2020
- Google Cloud Platform Big Data and Machine Learning Fundamentals
  DeepLearning.AI
  Mar 2021
- Dataiku ML Practitioner
  Dataiku
  Jun 2023
  View certificate
- Dataiku Core Designer
  Dataiku
  Jun 2023
  View certificate
- Hypothesis-Driven Development
  University of Virginia
  Mar 2021
- Big Data Integration and Processing
  San Diego Supercomputer Center
  Jun 2020
- Agile Meets Design Thinking
  University of Virginia
  Mar 2021
- Communicating Business Analytics Results
  University of Colorado Boulder
  Jan 2020
- Introduction to Big Data
  San Diego Supercomputer Center
  Jul 2020
- Data Warehouse Concepts, Design, and Data Integration
  University of Colorado Denver
  Nov 2020
- Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
  DeepLearning.AI
  Feb 2021
- Agile Analytics
  University of Virginia
  Mar 2021
- Managing an Agile Team
  University of Virginia
  Mar 2021
- Academy Accreditation - Databricks Lakehouse Fundamentals
  Databricks
  May 2023
  View certificate
- Business Metrics for Data-Driven Companies
  Duke University
  Jul 2020
- Hypothesis-Driven Development
  University of Virginia
  Mar 2021
- Data Science Ethics
  University of Michigan
  Jul 2020

Saeed Gavidel, PhD

Timeline

About me

Education

Wayne State University

Wayne State University

University of Tabriz

Amirkabir University of Technology - Tehran Polytechnic

Experience

Tabriz Oil Refining Co. (TZORC)

Data Analyst & Energy Management Engineer

Predictive Maintenance Senior Engineer

Wayne State University

Consumers Energy

Traxen

Ford Motor Company

Licenses & Certifications

Google Cloud Platform Fundamentals: Core Infrastructure

Google Cloud Platform Big Data and Machine Learning Fundamentals

Bets Poster Award

Lean Six Sigma DMAIC Methodology Course- Green Belt Lavel

Managing an Agile Team

Communicating Business Analytics Results

Leadership Communication for Maximum Impact: Storytelling

Structuring Machine Learning Projects

Neural Networks and Deep Learning

Tools for Data Science

Big Data Modeling and Management Systems

Data Science Ethics

Convolutional Neural Networks

Big Data Emerging Technologies

Google Cloud Platform Big Data and Machine Learning Fundamentals

Dataiku ML Practitioner

Dataiku Core Designer

Hypothesis-Driven Development

Big Data Integration and Processing

Agile Meets Design Thinking

Communicating Business Analytics Results

Introduction to Big Data

Data Warehouse Concepts, Design, and Data Integration

Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization

Agile Analytics

Managing an Agile Team

Academy Accreditation - Databricks Lakehouse Fundamentals

Business Metrics for Data-Driven Companies

Hypothesis-Driven Development

Data Science Ethics

Languages

Recommendations

Jose aaron cervantes iturriaga

Catriel jubells

Maikel reis

Rahman shittu

Jay lee

Alexandre barbieri

Katerina heytens

Yunita rahmadina

Chris warford

Pavel jahoda

Eleonora m.

Emma steel

Giulio ulivieri

Claude lecomte

Tasheen parrish thomas

Ronewa masuvhelele

Sepideh mirzaee

Cyraelaine coliflores

Aleksandar milenic

Ankit varmani