Luis Soares

Luis Soares

Data Analyst

Followers of Luis Soares2000 followers
location of Luis SoaresPorto Alegre, Rio Grande do Sul, Brasil

Connect with Luis Soares to Send Message

Connect

Connect with Luis Soares to Send Message

Connect
  • Timeline

  • About me

    I make high-quality data be quickly available | Data Engineer | AWS CCP | Scala | Airflow

  • Education

    • Universidade Federal do Rio Grande do Sul

      2017 - 2021
      Bachelor's degree Bioinformatics

      • Development and publication of MPSBase• Python Programming

    • IU International University of Applied Sciences

      2023 -
      Master of Science - MS Data Management

      • Research on alternative approaches for data processing on the AWS cloud, exploring innovative methods to optimize data handling, storage, and analysis workflows. This involved evaluating various ETL technologies to identify the most efficient and cost-effective solutions.

  • Experience

    • HCPA - Porto Alegre Teaching Hospital

      Jun 2017 - Mar 2021
      Data Analyst

      • I conceived and developed the first web databases for mucopolysaccharidoses, working as a full-stack developer, on a PHP-based application hugely based on a MySQL database, providing a comprehensive and accessible platform for researchers and medical professionals. This project led to a publication in a high-impact factor (4.2) scientific journal.• I conducted research in Machine Learning in R and Python, focusing on feature selection and classification of rare diseases. This involved leveraging algorithms and identifying key patterns and markers for accurate disease classification.• I created using R programming statistics pipelines for biological research. These pipelines streamlined data ingestion and standardization in order to be useful to automatically compare across species and NGS experiments. Show less

    • BlueMetrics

      Jul 2021 - Jul 2023
      Data Engineer

      • I made available several new data streams for analytics and reporting making the integration between Online Transactional Processing (PostgreSQL, SQLServer, MySQL) and Online Analytical Processing (Redshift) ETL architectures using AWS services.• I created dynamic and user-friendly dashboards on Sisense for a diverse range of business types, enabling customers to easily track and analyze key performance metrics specific to their needs, supporting clients in making value from analytical solutions. Show less

    • Lobby CRE

      Dec 2021 - Jul 2023
      Data Engineer

      • I designed and implemented a state-of-the-art SQL data lakehouse on Redshift from scratch, unlocking a host of advanced analytics solutions for the real estate industry.• I successfully developed innovative in-memory data processing solutions using DuckDB on an AWS ECS application, reducing the ingestion latency time 10 times fold.• I supported the CS team by providing comprehensive tech support, conducting in-depth data analysis, and closely monitoring AWS cloud infrastructure for prompt issue resolution. Show less

    • Optum

      Aug 2023 - Jul 2024
      Data Engineer

      • Modeled the data transformation process in order to standardize different EMR and EHR systems to the Optum platform, allowing the integration of several more services such as Epic Clarity and NAMM EHR.• Implemented databricks pipelines using Scala to execute batch transformations leveraging new business options and accelerating ETL processing on 10 times fold• Modeled queries on Snowflake for Data Analyst and Data Quality teams deliver high-quality data quickly

    • DoorDash

      Aug 2024 - Feb 2025
      Data Engineer

      ◦ Managed the integration of a new HR system after the acquisition of a company by the Doordash group.These efforts included ETL setup and testing, allowing the integration to be delivered in less than 2 months.◦ Created ETL pipelines for historical data backfilling using Airflow, distributing workloads to prevent peakresource consumption◦ Automated alternative HR data integration using Fivetran, significantly reducing ETL maintenance overhead and leveraging a secondary data source. Show less

  • Licenses & Certifications

    • Bronze Medal iGEM Giant Jamboree 2019

      IGEM Foundation
      Nov 2019
    • TOEFL ITP B2

      ETS
      Jun 2017
    • SQL (Advanced)

      HackerRank
      Jan 2023
      View certificate certificate
    • The Complete Hands-On Introduction to Apache Airflow

      Udemy
      Jul 2023
      View certificate certificate
    • Spark with PySpark Certification

      Udemy
      Jan 2023
      View certificate certificate
    • English Proficiency Certificate - 135/160

      Duolingo English Test
      Jan 2023
      View certificate certificate
    • Academy Accreditation - Databricks Fundamentals

      Databricks
      Jan 2024
      View certificate certificate
    • AWS Partner: Accreditation (Technical)

      AWS Training Online
      Mar 2022
    • Speexx German CEFR Level A1

      Speexx
      Jan 2023
      View certificate certificate
    • AWS PartnerCast - Data Integration with AWS Glue

      AWS Training Online
      Jan 2023
  • Honors & Awards

    • Awarded to Luis Soares
      1st Place on Math school olympiads 2016 Colégio Sinodal São Leopoldo out. de 2016
    • Awarded to Luis Soares
      1st Place Math school olympiads 2015 Colégio Sinodal São Leopoldo nov. de 2015