
Mikhail Pykhtin
Big Data Analyst-Engineer

Connect with Mikhail Pykhtin to Send Message
Connect
Connect with Mikhail Pykhtin to Send Message
ConnectTimeline
About me
Engineering Manager/ Team Lead – X5
Education

ITMO university
2016 - 2020Bachelor's degree Faculty of infocommunication technologies, Intelligent Systems in Humanities (Data Science) 4.4Graduation thesis: Collection and processing data for building a predictive model for the churn of subscribers of a mobile operator.
Experience

VEON
Aug 2019 - Oct 2020Big Data Analyst-EngineerStack: Hadoop, HiveQL, SQL, Python, NiFi.Responsibilities:- organizing and facilitating meetings with business clients;- collecting and writing business requirements;- writing technical specifications for the development of storefronts and ETL flows;- analysis, research and creation of predictors (features) for machine learning models;- building data marts (HiveQL) and launching ETL flows;- testing ETL flows and validating data quality (data completeness, statistical analysis) on Hadoop;- execution of ad-hoc requests using Python;- geo-analytics of subscriber data, analysis of the locations of the greatest activity of cellular subscribers.Projects:1. B2B team - internal projects for the sales and marketing department, providing results to the customer (key and large business segments).a) Improving the B2B customer churn model.2. Center of Excellence team - external projects aimed at developing Big Data in the CIS countries:a) Creation and implementation of a predictive outflow model in Beeline Uzbekistan;b) Creation and implementation of a credit scoring model for clients of Uzbekistan;c) Geoanalytics - building a geolayer and calculating the coverage of base stations in order to analyze the best places to install cellular base stations, create metro stations, place advertising, etc. Show less

Sberbank
Oct 2020 - May 2024Responsibilities: - management of a team of 6 Data Engineers (5 projects with 5 different customers); - promotion and dismissal of employees, management of positions (staff, outsourcing) in the team; - annual, quarterly and sprint planning of team tasks and resources; - distribution, delegation and prioritization of tasks; - control of task deadlines; - project risk management; - tracking development metrics (Lead Time, Time to Market, Velocity, development time, time for distractions) and developing measures to improve metrics; - preparation of presentations on tasks, metrics, goals. - coordination of contracts with counterparties; - selection of employees and decision-making on candidates; - conducting quarterly performance reviews of team members; - conducting one-on-one sessions and feedback meetings with the team; - motivation of team members;Incl. as Team Lead Data Engineer:- development of datamarts;- implementation of integration interactions through file exchange;- optimization of queries and calculations;- code refactoring;- design of solution architecture and integration interactions;- team training;- conducting code reviews;- conducting technical interviews.Achievements:1. Created a team practically from scratch (increasing the team from 1 to 6 people).2. Brought the team from outsiders (10th place) to the top 1 in terms of development metrics, stability of deadlines and quality of improvements in the department.3. Reducing Lead Time by 2 times due to changes in accepted processes in the team.4. Increasing team stability (2 years without layoffs) by solving problems of low motivation and mis-hiring.5. Solving the problem of low quality releases by replacing those. stack, introducing new testing processes and developing a new training system within the team.6. Implementation of Code Review practice from scratch.7. Implementation of development standards (naming, code style) within the team. Show less Stack: Spark (PySpark), Python, Hive, SQL, Hadoop, Airflow, Jenkins, Git.Responsibilities:- organizing and facilitating meetings with business clients;- collection of business requirements;- validation, comparison and selection of correct data sources;- development of pipelines (ETL flows) on PySpark for building data marts;- implementation of integration interactions through file exchange;- optimization of queries and calculations;- code refactoring;- setting up ETL flow configurations;- testing ETL flows and validating data quality (data completeness, statistical analysis) on Hadoop;- coordination and validation of data with the business customer;- execution of ad-hoc requests;- writing documentation for bringing improvements into release;- building the distribution using Jenkins (CI/CD);- support for improvements in an industrial environment (3rd line of support).Achievements:1. Reducing Lead Time by 2.5 times due to the use of a new development framework.2. Reducing the time for calculating the SberRating data mart by 3 times using query optimization. Show less
Engineering Manager
Jun 2021 - May 2024Lead Big Data Engineer
Oct 2020 - May 2021

X5 Digital
May 2024 - nowEngineering Managermanagement of a team of 8 Data EngineersStack: Apache Airflow, Pyspark, S3, Postgres, Oracle, MQ, API, Kafka
Licenses & Certifications
- View certificate

Professional Certificate IBM Data Analyst
CourseraNov 2020 - View certificate

Effective Motivation Skills
Samolov GroupApr 2022 - View certificate

Databases and SQL for Data Science
CourseraNov 2020 - View certificate

Apache spark for data engineering
New Professions LabApr 2021 - View certificate

Data Visualization & Dashboard Essentials
CourseraNov 2020 - View certificate

Effective Delegation Skills
Samolov GroupSept 2021
Recommendations

Christopher merritt
IT Manager @ Aegis Medical Group| Cybersecurity StudentUnited States
Michael van der kroon
Junior Consultant Actuarieel & RiskArnhem, Gelderland, Netherlands
Carol hublar
Licensed Massage TherapistGoshen, Kentucky, United States
Shivasharana nagalikar
Software Engineer at Digikey|| C++ DeveloperRaichur, Karnataka, India
Suria syafiqah
Human Resource (Core People Services Specialist) @ bp | HR AnalyticsWP. Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
Hannah phillips
Community Fundraiser and Social Media Officer at CMV Action/ Funding Manager at Drawing Board Produc...Stockport, England, United Kingdom
Ivan danilov
Manager IT SEE at Philip Morris InternationalNiš, Centralna Srbija, Serbia
Andrea alejandra fernández godani
Project Manager / Scrum MásterSantiago, Santiago Metropolitan Region, Chile
Moiz mansur
Site Engineer | Health Care Projects | Certified Biomedical Equipment Professional | Project Managem...Kuwait
Nicole phillips
Guest Experience Suite Host - O2 ArenaGreater London, England, United Kingdom
Chirag davda
INSURANCE ENTREPRENEUR at AAA UNIQUE INVESTMENTSBengaluru, Karnataka, India
Dominic stovall, ms,gsp
Safety Director | HSE, Workplace Safety, Supervisory SkillsAddis, Louisiana, United States
Pandiselvi murugan
IT Professional | Problem Solver | Enthusiastic about Technology AdvancementsTamil Nadu, India
Sai c.
Data Science Graduate | UH '23Houston, Texas, United States
Prakash upadhayaya
Industrial engineerKathmandu, Bāgmatī, Nepal
Stephen cardoz
Core Assistant Manager, Four Seasons Hotel DohaMumbai, Maharashtra, India
Aleksandra sawicka
Product Manager @BOLDPoznań, Wielkopolskie, Poland
Quez hunt
Sr. Implementation Manager at ABC Fitness | Field Marketing Specialist | Event Coordinator | Creati...Douglasville, Georgia, Estados Unidos
Nadya arwie
Compliance Officer di PT. MondrianCentral Java, Indonesia
Kristoff kasch
Senior Solutions Architect at DatacentrixCity of Johannesburg, Gauteng, South Africa
...