Muhammad Mudassir Raza

Muhammad Mudassir Raza

Data Engineer

Followers of Muhammad Mudassir Raza1000 followers
location of Muhammad Mudassir RazaKarāchi, Sindh, Pakistan

Connect with Muhammad Mudassir Raza to Send Message

Connect

Connect with Muhammad Mudassir Raza to Send Message

Connect
  • Timeline

  • About me

    Data Engineer | Web Scraping Expert | ETL Pipelines | Data Modeling | AWS & GCP Certified

  • Education

    • Federal Urdu University of Arts, Science & Technology, Karachi.

      2019 - 2022
      Bachelor's degree Computer Science
  • Experience

    • Xloop Digital Services (Pvt) Ltd

      Nov 2022 - Jan 2024
      Data Engineer

      ● Developed a real-time room occupancy detection system using FastAPI, Kafka, Pyspark, PostgreSQL, and Random Forest, achieving a 20% accuracy improvement.● Automated the entire microservices architecture with Docker, integrating Flask for real-time predictions and end-to-end system development.● Engineered a real-time crypto data analytics pipeline using Python, Beautiful Soup, Kafka, and AWS S3 for seamless data extraction and integration.● Improved data processing efficiency by 30% with Snowflake and Snowpipe, enabling faster insights and decision-making.● Led backend development of a Learning Management System (LMS) using Django, focusing on data modeling, API implementation, and advanced user authentication.● Reduced post-release issues by 25% through comprehensive testing, debugging, and active participation in Agile development. Show less

    • Karachi AI - Community of AI Practitioners

      Jun 2023 - Aug 2023
      Data Engineer

      ● Managed an ETL pipeline with Apache Airflow, Python, requests, Beautiful Soup and MySQL, performing data extraction, transformations, and exploratory data analysis (EDA) with Pandas.● Improved data accuracy by 20% and enabled real-time dashboard updates in Google Sheets for stakeholders.● Developed and deployed a Streamlit dashboard visualizing financial consumer complaints across states, extracting data from Google Sheets.● Increased stakeholder engagement by 30% through intuitive state-wise insights and actionable items.● Led a data migration project from OLTP to OLAP using MySQL procedures, establishing a star schema for dimensions and facts.● Implemented and validated Pyspark and SparkSQL SCD2 for dynamic data updates, ensuring migration accuracy through comprehensive testing.● Developed and deployed a Python Flask API for user data submission, integrated with MySQL and containerized with Docker.● Improved user experience by 25% through streamlined data submission and efficient containerized deployment. Show less

    • MarketLytics

      Jan 2024 - now
      Data Engineer

      ● Developed an ETL pipeline in which i extract data through API from wincher and dumb into Big Query and then through SQL do data modeling and connect to looker studio reports and automate pipeline using cloud function.● Developed an ETL pipeline utilizing Python requests for web scraping to extract data from a list of websites hosted on Google Cloud Storage.● Integrated authentication with Google Cloud Platform (GCP) services for seamless data extraction and transformation.● Orchestrated the entire process within a Cloud Function environment, automating the extraction and uploading of data to Bigquery in JSON format.● Extract Data from Google Analytics and Dump Data to Bigquery and perform analysis using SQL queries in Bigquery ● Making staging models at DBT of data in BigQuery . Through this data move to data mart timely and fastly by CICD pipeline. Show less

  • Licenses & Certifications