Yan Yin

Yan Yin

AI Researcher intern

Followers of Yan Yin2000 followers
location of Yan YinBeijing, China

Connect with Yan Yin to Send Message

Connect

Connect with Yan Yin to Send Message

Connect
  • Timeline

  • About me

    Data Scientist at ByteDance

  • Education

    • Renmin University of China

      2014 - 2018
      Bachelor's degree MATHEMATICS AND STATISTICS 3.78/4.00

      Activities and Societies: Minister of Human Resource Department in Statistical Research Association of RUC Youth Volunteer Association of RUC Director of Marketing and Public Relations ■Deep Learning Collaborative Model on Implicit Feedback Datasets ■Split Questionnaire Design for Long Questionnaires■Textual Analysis Towards Spring Festival in Microblog ■Sentiment Analysis for Financial News Article ■Geriatric Data Quality Assessment for China National Committee■Bike Demand prediction for Bike Sharing System ■Investigation of Land Rights Reform Process■Relationship between the Value of Children(VOC) and Desired Fertility

    • Cornell University

      2018 - 2019
      Master's degree Statistics 4.0/4.0
  • Experience

    • Baidu, Inc.

      Aug 2017 - Oct 2017
      AI Researcher intern

      • Collected and tracked daily international Artificial Intelligence reports by building a web crawling• Designed market research towards truck drivers about drowsiness detection devices

    • Schlumberger

      Oct 2017 - Mar 2018
      Data Scientist intern

      • Conducted data cleaning on historical oilfield trajectory data, deduped customer name in database, plotted 3D trajectory to explore patterns, and collaborated with product managers and engineers to achieve feature engineering.• Built an interactive gas drilling map for team to find patterns and prioritize their work.• Developed gradient boosting model to classify types of drilling trajectories and got 98% accuracy. The model gave insights to the input of building a drilling trajectory design software.• Implemented and optimized data pipeline using python to perform ETL on 3.5 million data from database. Show less

    • Ten-X

      Sept 2019 - Jan 2020
      Associate data scientist

      • Analyzed real estate website’s pageview data using Random Forest to rank potential bidders in their inclination to bid on the listed assets, achieved 92% of accuracy on test data. Worked with sales team to refine their strategy, target clients and improved sales.• Developed and maintained an email recommender system to suggest relevant assets for users using logistic regression. Put the algorithm in production with stakeholders.• Monitored click metrics of recommendations, explored new features and conducted AB testing to further test the recommender system’s efficiency. Increased customer interaction by 3.5% after adding the new feature.• Retrained and maintained scalability of the business entity deduplication model using Random Forest as the business entities data volume increased. False negative rate decreased 4%. Performed hierarchical clustering on the output of Random Forest algorithm. The processed data became the primary resources of our database. Show less

    • CrowdDoing

      Dec 2019 - May 2020
      Data Scientist (volunteer)

      Natural language processing

    • Walmart

      Apr 2020 - Nov 2021
      Data Scientist
    • ByteDance

      Nov 2021 - now
      Data Scientist
  • Licenses & Certifications

  • Honors & Awards

    • Awarded to Yan Yin
      Received 20000 research fund in 2016 Undergraduate Innovative Test Program (top 5%) -
    • Awarded to Yan Yin
      Received Academic Excellence Scholarship in three consecutive years (top 10%) -
    • Awarded to Yan Yin
      Won second prize in National College Student Academic Works Competition(top 10%) -