Le Ju

Le Ju

Followers of Le Ju806 followers
location of Le JuGreater Boston

Connect with Le Ju to Send Message

Connect

Connect with Le Ju to Send Message

Connect
  • Timeline

  • About me

    Master's student in Data Science at NEU | Ex Data Scientist Intern @Uber | B.S. in Mathematics at RHIT

  • Education

    • Rose-Hulman Institute of Technology

      2018 - 2023
      Bachelor of Science - BS Mathematics

      Activities and Societies: Association for Women in Mathematics

    • Northeastern University

      2023 - 2025
      Master of Science - MS Data Science 3.8/4.0

      Activities and Societies: Khoury Data Science Hub

  • Experience

    • Rose-Hulman Institute of Technology

      Dec 2020 - Feb 2023

      • Provided English as a Second Language (ESL) support to students who are non-native speakers of English• Worked individually or in small groups with international students to help them edit academic writing, practice conversation, improve pronunciation, grasp complex readings, or build other English skills

      • Peer Tutor for ESL Program in the Center for Global Engagement

        Apr 2021 - Feb 2023
      • Teaching Assistant for the Math Department

        Dec 2020 - Feb 2023
    • ChangAn International Trust Co., Ltd.

      Feb 2021 - Jun 2021
      Data Science/Analytics Intern

      ● Predicted user churn by engineering features and applying Random Forest, Boosting, and SVM models with hyperparameter tuning, achieving 95% accuracy and 84% recall while identifying churned and active customers● Utilized Excel (PivotTables, automated reporting) to analyze financial statements, cutting reporting cycles by 25% and identifying key cash flow issues, while constructing and testing a data pipeline and fully automated, interactive Tableau dashboards for internal stakeholders, enabling effective performance tracing● Conducted and analyzed A/B testing experiments to evaluate improvements in asset management efficiency Show less

    • Uber

      Apr 2023 - Aug 2023
      Data Scientist Intern

      ● Designed and implemented ETL solutions for 200GB+ data, optimized 40+ tables, reconstructed the database to reduce average query runtime by 20%, and applied advanced data cleansing techniques to address missing values, outliers, and distorted data distributions in Python, improving accuracy by 30% and reducing manual labor by 90%● Conducted exploratory data analysis to assess the impact of factors such as mileage, time, demand, weather, and accessibility on trip pricing in online taxi systems, using Python data visualization tools (Matplotlib, Seaborn)● Gathered and cleaned user engagement data in SQL, built and tested machine learning models (Lasso, CART, GBDT, XGBoost) to forecast time-share order peaks by zip code, saving over 10 work hours per week for the travel pricing team● Collaborated with cross-functional operations teams to identify high-value users and urgent orders using clustering algorithms (K-means, DBSCAN, Agglomerative), driving an 8% increase in user retention rate● Developed a computer vision system using image segmentation and CNNs with PyTorch to evaluate in-car COVID precautions, improving safety compliance monitoring for Uber Show less

    • Northeastern University

      Jan 2024 - now
      Teaching Assistant

      Teaching Assistant for Khoury College of Computer Science (DS 4200 Information Visualization, DS 3000 Foundations of Data Science)

    • Lantheus

      Jul 2024 - Dec 2024
      Data Science Co-op

      ● Engineered highly optimized SQL queries in Snowflake to efficiently extract, transform, and aggregate multi-terabyte medical claims data, introducing advanced Python-based data processing techniques to automate manual Excel workflows, doubling team productivity and enabling more accurate, timely business insights● Designed and deployed dynamic Power BI dashboards, incorporating DAX for real-time KPIs and trend visualizations, facilitating data-driven decision-making for senior management● Implemented reuseable data cleansing and feature engineering pipelines in Python, ensuring the integrity of large-scale datasets (10M+ records), enabling the development of high-accuracy predictive models● Built and fine-tuned Machine Learning models (Tree-based, ANN, Time Series Forecasting) to predict market trends in prostate cancer diagnosis, identifying a projected 60% growth in the PSMA PET imaging market by the end of 2025● Developed competitive intelligence frameworks for the key product market by integrating sales, claims, epidemiological data, and predictive modeling, uncovering a $25M market opportunity and driving a 12% increase in regional revenue Show less

  • Licenses & Certifications

    • AWS Academy Graduate - AWS Academy Cloud Foundations

      Amazon Web Services (AWS)
      Oct 2024
      View certificate certificate
    • AWS Academy Graduate - AWS Academy Cloud Architecting

      Amazon Web Services (AWS)
      Nov 2024
      View certificate certificate