
Timeline
About me
Research Data Scientist and Statistician at Google
Education

University of southern california - marshall school of business
2018 - 2023Doctor of philosophy - phd data science and operations• Selected coursework: regression & generalized linear models, causal inference, econometrics, panel data modeling, statistical inference, deep learning theory & practice with convolutional neural networks, dynamic programming & reinforcement learning theory.• Awarded “Top Reviewer” for 2023 International Conference on Artificial Intelligence and Statistics (AISTATS).• Gave invited talk on a novel fairness proposal at the 2020 Copenhagen Workshop on Algorithmic Fairness… Show more • Selected coursework: regression & generalized linear models, causal inference, econometrics, panel data modeling, statistical inference, deep learning theory & practice with convolutional neural networks, dynamic programming & reinforcement learning theory.• Awarded “Top Reviewer” for 2023 International Conference on Artificial Intelligence and Statistics (AISTATS).• Gave invited talk on a novel fairness proposal at the 2020 Copenhagen Workshop on Algorithmic Fairness (https://ps.au.dk/en/cepdisc/events/event/artikel/workshop-on-algorithmic-fairness/). Show less

Washington university in st. louis
2006 - 2009Bachelor of arts (b.a.) physics 3.85 gpa• Graduated Phi Beta Kappa and with College Honors in Arts and Sciences. Dean's List every semester.• Recipient of National Merit Scholarship and William George Eliot Scholarship.

University of cape town
2008 - 2008Physics
Experience

Self-employed
Jan 2009 - Aug 2018TutorI tutored middle school through adult students (and all ages in between) in physics, math (including calculus and college-level statistics), chemistry, and test prep (SAT, ACT, SSAT, ISEE, etc).

No name (private venue on fairfax)
Dec 2014 - Dec 2017Front of house audio engineerClients include musicians (Stevie Wonder, Lady Gaga, John Mayer, Nas, Panic! At The Disco, Travis Barker, etc.), comedians (Dave Chappelle, Tig Notaro, Jerrod Carmichael, etc.), and companies (Adobe, Activision, Variety Magazine, United Talent Agency, Ketel One, etc.).Responsible for making great events happen by handling a wide variety of technical issues on a tight timeframe while still remaining calm and pleasant (to avoid unnecessary mistakes and to make life more pleasant for our clients, the musicians I'm helping!). I rigorously check for any possible sources of error to prevent hiccups, and deal with the inevitable mistakes and oversights that still come up as quickly and seamlessly as possible, ideally before the musicians or audience even notice.Duties include planning mix sessions and stage plots, guiding the band and management through sound check to make sure they're happy with their sound, mixing, recording our shows in Pro Tools, patching cables, and setting up microphones.Info on the venue: https://www.yelp.com/biz/no-name-club-los-angeles Show less

Live nation
Feb 2017 - May 2017Machine learning internWorking remotely, independently created and implemented a time series algorithm in R to predict future concert set lists of bands based on past set list data. A description of how it works is attached. You can also view the code and the summary of how it works at https://github.com/gregfaletto/setlistpredictor.

Ziprecruiter
Jul 2017 - Jan 2018Data analytics research intern• Using R, developed a model to predict whether a job seeker will apply to a given job listing to infer preferences for listed perks.• Accessed data via SQL. Chose observational study design and model form. Conceived outcome metric, constructed from raw data.• Final deliverable was approximately 3500 lines of R code and a written report.

University of southern california
Jan 2019 - May 2023Research assistant, graduate assistant lecturer• Full-time data science researcher and lecturer Jan. 2021-May 2023; part-time research assistant Jan. 2019-Dec. 2020.• Designed, coded, and tested novel methods for top venues (International Conference on Machine Learning, PNAS). (1) [ICML 2023] PRESTO estimates rare event probabilities, like probability of purchase after viewing an ad (github.com/gregfaletto/presto). (2) Fused extended two-way fixed effects is a panel data causal inference (difference-in-differences with staggered adoptions) ML method (arxiv.org/abs/2312.05985; under review at Journal of Econometrics). (3) Cluster stability selection is a feature selection method for clustered features (github.com/gregfaletto/cssr-project).• Created a novel recommendation system with a startup. Used matrix completion to estimate factors of an approximately low-rank matrix, and harnessed learned factors in a model estimating click probabilities, improving probability estimation by 5.7%.• Taught 100 students $375,000 worth of courses on analytics in Excel & JMP (SAS); making dashboards; communicating results. Show less

Google
May 2021 - Aug 2021Data scientist intern• Designed, programmed simulation studies in Python to quantify flaw in prior method for estimating A/B test treatment effects. Crafted specific solution from problem description. Created a new method for treatment effect estimation & coded in Python.• Coordinated with team, responding to and incorporating broad-strokes objectives, informal feedback, and formal code reviews.• Reduced bias and MSE of treatment effect estimates by over 99% in simulations, while controlling Type I error rate much better.• Submitted 4000 lines of documented, reviewed Python code to Google codebase implementing method and simulation studies. Show less

Videoamp
Mar 2023 - Apr 2025• Create methodologies using advanced but interpretable causal inference techniques (including double machine learning, instrumental variables, & difference-in-differences) to estimate lift of ad campaigns on observational & experimental data. Derive confidence intervals, conduct power analyses, & write code for methodologies at production scale (tens of millions of observations). • Inspect data quality using tools like exploratory analysis and feature importance metrics to ensure superior model performance.• Collaborate across teams to better understand data and clients’ priorities to direct my efforts.• Created tool to compare viewing metrics under actual ad schedule vs. alternative to demonstrate the value of targeting.• Produce & maintain documentation. Communicate complex data analysis results to stakeholders (presentations, writing).• Proactive identify problems and clearly communicate early to solve. Identify and communicate blockers.• Use tools including Python (scikit-learn, matplotlib, NumPy, pandas), SQL (Snowflake), GitHub. Show less
Intermediate Data Scientist
Jul 2023 - Apr 2025Part-Time Apprentice
Mar 2023 - Jun 2023

Google
Apr 2025 - nowData scientist research, payments
Licenses & Certifications
- View certificate

Statistical learning (completion with distinction)
Stanford universityDec 2016 - View certificate

Certificate of appreciation
The global association for research methods and data scienceJun 2020 - View certificate

Intro to python for data science
Datacamp - View certificate

Data visualization with ggplot2 (part 1)
Datacamp - View certificate

Introduction to r
Datacamp - View certificate

Intermediate r
Datacamp
Honors & Awards
- Awarded to Greg FalettoWinning Solution--2020 COVID-19 Computational Challenge RMDS Jun 2020 Awarded second place (tied) in the competition. Using daily neighborhood test results across Los Angeles, developed a GLM panel data model (Poisson autoregressive count model) to forecast new infections of COVID-19 a day ahead in each neighborhood in Los Angeles. Used infection predictions to generate daily forecasted risk scores for each neighborhood. Joint work with Mohammad Mehrabi. Full solution and writeup available at https://grmds.org/2020challenge.
- Awarded to Greg FalettoBest Model--Orange County R Users Group Hackathon 2019 Orange County R Users Group May 2019 Participated in team that won "Best Model" at the Orange County R Users Group Hackathon 2019. Over the course of two days, developed a model associating health outcomes in California counties with levels of water pollutants. More information at https://gregoryfaletto.com/2019/05/19/our-entry-in-the-ocrug-hackathon-2019/ and https://twitter.com/oc_rug/status/1130553620456300544.
Recommendations

Holly geary
Secretary/TreasurerWarrenton, Virginia, United States
John dickson
Eastern Regional Director at the Employment Security DepartmentSpokane, Washington, United States
Michaela hippert
MS2 at St. George’s University School of MedicineWorcester, Massachusetts, United States
Sencer üstünses
Sales DirectorIstanbul, Istanbul, Türkiye
Anna stanley-smith
VP of Quality at Isomerase Therapeutics Ltd.Greater Cambridge Area
Mansi bansal
Business Analyst | Gold medalist- MBA NITKKR |Chandigarh, India
Catherine chapman, bsn, rn, cdn
Clinical Educator at Fresenius Medical Care North AmericaBurlington, North Carolina, United States
Zack orr
Director Of Ticket Operations at The University of Tulsa AthleticsTulsa, Oklahoma, United States
Azeddine htioui
Directeur agence chez Dragon Media NetworkDragon Media Network
Jorge rodríguez, cipm
Product Development Coord. at BMW Group Financial ServicesAzcapotzalco, Mexico City, Mexico
John sillitti
Senior Customer Success Manager at CDK GlobalSomonauk, Illinois, United States
Nafees ahmed
Digital Marketing Enthusiast | SEO & SMM SpecialistKarāchi, Sindh, Pakistan
Moncius joseph
Fue a InfotepRepública Dominicana
Simeon todorov, acma, cgma
Senior Manager | Associate of CIMA | Lean Six Sigma Certified | Process Improvement expertBulgaria
Edward salazar
Sr Executive Sous Chef Bon AppetitColorado Springs, Colorado, United States
Martín margonari
Software EngineerArgentina
Vitalii shevchenko
Frontend DeveloperAmsterdam, Noord-Holland, Nederland
Brandon blackburn
Leadership, Learning, & DevelopmentGreater Columbus Area
Raquel i. monforte
PhD Candidate in Film and Health | GuionistaSpain
Emmett jordan
Principal & Managing Director at Jordan Consulting & Professional ServicesWashington DC-Baltimore Area
...