
Timeline
About me
Data Engineer | Hadoop, HDFS, Sqoop, Hive, Spark, SQL, Scala, AWS | Apache Spark, Spark-SQL | Expertise in Optimizing Spark Jobs & Cost Reduction | Actively Seeking New Opportunities
Education

Kalasalingam university
2017 - 2021Bachelor of technology computer science 8.6
Experience

Tata consultancy services
Jun 2021 - Feb 2023Assistant system engineer• Involved in loading data into HDFS from different Data sources like SQL Server, AWS S3 using Sqoop and load into Hive tables.• Involved in creating Hive tables, loading data from different data sources, HDFS locations and other hive tables.• Created SQOOP jobs and scheduled them to handle incremental loads from RDBMS into HDFS and applied Spark transformations.• Created Hive external tables to perform ETL on data that is generated on daily basis.• Developed Spark code in Scala and Python (Pyspark) and deployed it in AWS EMR.• Was responsible for Optimizing Spark SQL and HIVE queries that helped in saving Cost to project.• Worked in monitoring, managing, and troubleshooting Hadoop and Spark Log files.• Worked on Hadoop within Cloudera Data Platform and running services through Cloudera manager.• Involved in Agile methodologies, daily Scrum meetings, Sprint planning. Show less

Ipsos
Mar 2023 - Sept 2023Software engineer• Collaborated with data modeling teams, stakeholders, and data analysts to comprehend data requirements and translate them into technical specifications and structured data representations.• Developed Spark applications in Scala for performing data cleansing, event enrichment, data aggregation, and data preparation to meet business requirements.• Implemented data quality checks and validation processes to ensure accuracy, consistency, and completeness of data.• Worked on various data formats like AVRO, Sequence File, JSON, Parquet, and XML.• Worked on fine-tuning spark applications to improve overall processing time for pipelines.• Created Hive tables, loaded with data, and wrote Hive queries to process data. Created Partitions and used Bucketing on Hive tables and used required parameters to improve performance.• Debugged common issues with Spark RDDs and Data Frames, resolved production issues, and ensured seamless data processing in production environments.• As per business requirement stored spark processed data in HDFS/S3 with appropriate file formats.• Performed Import and Export of data into HDFS and Hive using Sqoop and managed data within environment.• Created EC2 instances and EMR clusters for Spark Code development and testing.• Performed step execution in EMR clusters for spark job deployment as per requirements.• Used Agile Scrum methodology/ Scrum Alliance for development. Show less

Societe generale
Apr 2024 - nowBig data engineer
Licenses & Certifications
- View certificate

Sap certified development associate - abap with sap netweaver 7.50
SapMar 2021 - View certificate

Python functions, files, and dictionaries
CourseraDec 2020 - View certificate

Python basics
CourseraNov 2020
Recommendations

Lakshmi narayanan
***Thoothukudi, Tamil Nadu, India
David orfega ikyaagba, r.engr, mnse, mnieee, iaeng
I am an electrical electronics Engineer with highly developed skills and interest in engineering con...Abuja, Federal Capital Territory, Nigeria
Kaline azevedo
Radialista l Diretora de Imagens l TV Cabo BrancoJoão Pessoa, Paraíba, Brazil
John crowder
SQF Practitioner, Aseptic Processing, Software Developer \ Data analysis (MSSQL), Maintenance Mana...Chatham, Ontario, Canada
Cheryle culler, pe, bcee
Utility and Railroad Engineer at Indiana Department of TransportationHicksville, Ohio, United States
Laura bobrova
Material Planner at AlstomAstana, Kazakhstan
Evan zamroni
Procurement section di Pt. YamindoSurabaya, East Java, Indonesia
Naim chlih
Master's degreeLongueuil, Quebec, Canada
정현기
현대모비스 ADAS 센서퓨전 SWGyeonggi, South Korea
Алексей кононов
Junior Java developerМосква, Москва, Россия
Elisa slabik-marx
Manager | Cloud Development at Deloitte ConsultingBerlin, Berlin, Germany
Fajar wahyudin
Customer Service and Operation Assistant Manager at Kintetsu World ExpressJakarta, Jakarta, Indonesia
Daniel de la hera lópez, frm
Amazon | INSEAD | Finance | ConsultingGreater Madrid Metropolitan Area
Suraj ramesh
CSPO® | SAFe|Product Owner | Travel Cruise Hospitality | ConsultantTrivandrum, Kerala, India
Joyce roxanne zacarias
Graphic DesignerGreater Montreal Metropolitan Area
Anthony e collins
Consultant and Independent DirectorIreland
Isabella deluca
Accenture Strategy & Consulting Associate Manager | One Journey Team Leader for NGO Partnerships & S...Washington DC-Baltimore Area
Ivaylo stoilov
Key Account Manager at Velia.net Internetdienste GmbHLozenets, Sofia City, Bulgaria
Sha'ira zuiverloon
B737 First Officer at Surinam AirwaysParamaribo, Suriname
Yepi susanti
Project Manager | Virtual Assistant to busy professionals and entrepreneursBatam, Riau Islands, Indonesia
...