Solomon Kimunyu

Solomon Kimunyu

Data Scientist

Followers of Solomon Kimunyu8000 followers
location of Solomon KimunyuNairobi County, Kenya

Connect with Solomon Kimunyu to Send Message

Connect

Connect with Solomon Kimunyu to Send Message

Connect
  • Timeline

  • About me

    Senior Data Scientist | AI Engineer | Agentic AI | AI researcher | Data Engineer | Data analyst | Software engineer

  • Education

    • Udemy

      -
      Credit risk modeling
    • Kaggle

      -
      Computer vision and deep learning
    • Maranda High school

      -
    • Monketype

      -
      Touch typing Highest speed: 119 words per minute with 100% accuracy.
    • Coursera

      -
      Data science
    • Karatina University

      -
      Bachelor's degree in Actuarial Science

      Activities and Societies: Data science community Innovation Club, Karatina University Actuarial Students Association, Applied Statistics Association, Volunteers club,

  • Experience

    • Kaggle

      Jan 2018 - Mar 2020
      Data Scientist

      1. Text classification: Built a model using natural language processing (SpaCy, NLTK & Fastai) toidentify potential toxic comments on Wikipedia, contributing to a safer online environment.2. Real-time Starfish Object Detection: Developed an object detection model (YOLOv5) for real-time starfish identification, enhancing research or conservation efforts.3. Health Recommender System: Built a health recommender system leveraging various similarity measures (Person, Jaccard, Cosine) to personalize health recommendations.4. Chronic Disease Analysis and Scoring: Developed a system for chronic disease scoring and dataanalysis using Python libraries (Pandas, Matplotlib, Seaborn, Plotly).5. Investment Return Rate Forecasting: Utilized machine learning (Fastai & Keras) to create a modelpredicting investment returns, aiding in informed financial decisions. Show less

    • Zindi

      Mar 2020 - Dec 2021
      Data Scientist

      1. Social media analysis: Leveraging NLP to analyze sentiment in Swahili conversations for brand insights.2. Environmental conservation: Developing AI models for automated turtle recognition and marine invertebrate classification.3. Bridging the digital divide: Engineering Wolof speech recognition to empower non-literate individuals.4. Infrastructure development: Utilizing AI for road segment identification from satellite imagery.5. Financial inclusion: Predicting individuals most likely to benefit from banking services in East Africa.6. Retail optimization: Designing solutions for real-time product removal prediction, enhancing customer experience.7. Social good: Collaborating on building algorithms to classify tweets about gender-based violence and predict air quality for informed decision-making.8. Agriculture: Developing AI models for passion fruit disease classification and crop classification in South Africa using satellite data.9. Financial services: Predicting customer churn, solar customer payments, and vehicle insurance claims to optimize resource allocation and customer service. Show less

    • Predictive Analytics Lab

      Jan 2021 - Mar 2022
      Data science trainer

      1. Mentored 20+ data science students, fostering their development in building efficient machine learning models and achieving a 99% success rate on their projects.2. Organized a machine learning competition on Kaggle that required students to apply their data science skills to predict future sales prices using advanced regression techniques, fostering engagement and practical application of skills for data science students.3. Developed and delivered data science curriculum, equipping students with skills in essential areas like Python programming, data analysis with libraries like pandas and NumPy, machine learning with algorithms like decision trees and random forests, deep learning using frameworks like TensorFlow and PyTorch, computer vision with tools like OpenCV, and recommender systems using collaborative filtering techniques. Show less

    • AICE Africa

      Aug 2021 - Jun 2022
      Data Scientist

      1. Developed a customer chatbot (OpenAI GPT-3) providing accurate COVID-19 information, streamlining information access for users.2. Built an AI maturity assessment tool (Django, HTML/CSS/JS) for clients, enabling data-driven evaluation of their AI readiness.3. Analyzed financial data (Python, pandas, SQL) for clients, identifying trends and insights using visualization libraries (Seaborn, Plotly, Matplotlib).4. Cleaned and preprocessed financial data (pandas, scikit-learn) to prepare it for further analysis and modeling.5. Developed a credit scoring system for a fintech client (Django, Postman, MongoDB), facilitating efficient risk assessment and decision-making.6. Created and deployed a product recommendation engine (NetworkX, Streamlit), personalizing product suggestions for users and enhancing customer experience. Show less

    • Equity Bank Limited

      Jun 2022 - Mar 2024
      Data Scientist

      1. Built a customer churn prediction model using Python (XGBoost, LightGBM), increasing retention by 55%, boosting deposits by 20%, and improving loan uptake by 5%, leading to a 50% reduction in churn.2. Developed a customer acquisition model that identified 15M potential customers, increasing customer acquisition by 50% and deposits by 25%.3. Developed a fraud detection model using Python (CatBoost, XGBoost, Pandas), preventing 11K fraudulent transactions, significantly strengthening security measures.4. Built a credit monitoring dashboard in Power BI and SQL, cutting default rates by 50% through proactive risk management.5. Architected an OCR-based document processing system (OpenCV, TensorFlow, PyTorch), reducing KYC verification time by 40%, cutting manual errors by 80%, and decreasing fraudulent applications by 35%.6. Created a 360-degree customer insights dashboard using Power BI and SQL, cutting branch service time by 50% and improving customer experience.7. Built a customer reactivation model (XGBoost, Random Forest), reactivating 1M dormant customers and reducing dormancy rates by 12%.8. Created a rule-based customer ranking model, onboarding 20K new customers monthly, driving sustainable growth.9. Analyzed 1B+ transactions using Python and SQL, driving a 50% increase in card spending and a 20% rise in new card users, while reducing transaction failure rates from 5% to 0.45%.10. Created an ecosystem onboarding model, ensuring customer funds remained within the financial ecosystem by optimizing supplier and distributor engagement.11. Designed an insurance cross-sell model, increasing insurance uptake by 50% and identifying 42K health insurance and 100K vehicle insurance prospects.12. Developed a treasury analytics report, increasing monthly forex customers from 50K to 90K and boosting deal value by 50%. Show less

    • Equity Bank Limited

      May 2024 - now
      Senior Data Scientist

      1. Developed credit scoring models across 5 countries, scoring 25M customers by integrating internal and external data, increasing loan disbursements by 59%, and achieving a 98% repayment rate while eliminating reliance on third-party scoring services.2. Designed a loan recovery prioritization model leveraging Gradient Boosting, increasing monthly recoveries through targeted collection strategies such as automated loan recovery processes using Python and SMS notification systems, increasing repayment rates from 94% to 98%, and reducing written-off loans by 40%.3. Developed an AI model for credit scoring, resulting in a 50% growth in lending activity and annual cost savings amounting to millions of dollars.4. Created fraud detection models surpassing industry standards, achieving a false positive rate below 20% and a true positive rate exceeding 80%.5. Engineered customer acquisition models that identified 13 million new prospects.6. Devised cross-selling models spanning banking, investment, and insurance products, potentially doubling organizational product revenues.7. Designed multiple dashboards supporting rapid decision-making across Finance, Customer Experience, Credit, Operations, and Insurance domains.8. Developed and deployed chatbots powered by Large Language Models (LLMs) integrated with Neo4j graph databases, enhancing data retrieval and customer interaction.9. Implemented Retrieval-Augmented Generation (RAG) to dynamically fetch relevant information, boosting response accuracy and improving customer satisfaction by 40% while reducing response times by 60%.10. Built a knowledge graph with Neo4j to organize customer support data, enabling efficient automated query resolution through LLMs.11. Fine-tuned LLMs (LLAMA and Deepseek) using LoRA (Low-Rank Adaptation) to create domain-specific chatbots for banking and finance, reducing customer support tickets by 30% and increasing customer engagement by 25%. Show less

  • Licenses & Certifications

    • Spss

      Datactuary
      Jan 2018
    • Business and entrepreneurship

      Feb 2016
    • TensorFlow for AI: Computer Vision Basics

      Coursera
      Nov 2020
      View certificate certificate
    • Building Similarity Based Recommendation System

      Course
      Nov 2020
      View certificate certificate
    • Tensorflow for AI: Getting to know tensorflow

      Coursera
      Nov 2020
      View certificate certificate
    • Artificial Intelligence master class

      Udemy
      Jan 2020
      View certificate certificate
    • Movie Recommendation System using Collaborative Filtering

      Coursera
      Nov 2020
      View certificate certificate
    • ICT

      Digital Opportunity Trust
      Jan 2017
    • Spark and python for big data and pyspark

      Udemy
      Jan 2020
      View certificate certificate
    • Data Science Course 2020

      Udemy
      Jan 2020
      View certificate certificate
  • Honors & Awards

    • Awarded to Solomon Kimunyu
      Data scientist of the week - Jul 2021 Predictive Analytics Lab featured Solomon as the data scientist of the week on a blog article and LinkedIn
    • Awarded to Solomon Kimunyu
      Digital Insurance claim management system IBM Nov 2019 Solomon secured first place after presenting the Digital Insurance claim management system.Digital Insurance claim management systems can help the insurance industry speed up the claim process and reduce fraud associated with claims.
    • Awarded to Solomon Kimunyu
      Digital Insurance model Actuarial students association Jun 2019 Solomon presented the digital insurance model that demonstrated how artificial intelligence could improve customer experience and reduce fraud in the insurance industry.
    • Awarded to Solomon Kimunyu
      2nd place, Financial inclusion hackathon - Solomon was ranked 2nd in the financial inclusion hackathon. In this hackathon, Solomon created a machine learning model that accurately predicted that individuals are most likely to have or use a bank account in Africa.Solomon was awarded Nvidia coupons to learn Nvidia courses.
    • Awarded to Solomon Kimunyu
      Automatic speech recognition - Engineered an automatic speech recognition model on Wolof language spoken inSenegal using wav2vec3-xlsr that can help illiterate people use existing apps to findwhich bus they can take to reach their destination without having to know how to read orwrite.
    • Awarded to Solomon Kimunyu
      Autonomous shopping - Built a white-label autonomous shopping solution for Cape AI and Wellness Warehouseusing neural networks. This solution predicts whether a stream of 30 video frames from areal customer’s shopping session contains footage of a customer taking a product off theshelf. This solution provided an accurate, autonomous and a reliable autonomoussolution.
    • Awarded to Solomon Kimunyu
      Deep learning Indaba x hackathon winner My team won a hackathon organized by Deep Learning Indaba and Africa Law Tech
    • Awarded to Solomon Kimunyu
      Financial inclusion competition - Collaborated with a team of AI engineers from Malawi and Nigeria to build a machinelearning model that predicts which individuals are most likely to have or to use a bankaccount using catboost and xgboost (python libraries). This model provided an indicationof the state of financial inclusion in East Africa.
    • Awarded to Solomon Kimunyu
      Marine vertabrate classification - Created an automated image classification solution for photographs of marineinvertebrates taken by researchers in South Africa using fastai and resnet (pretrainedimage classification model). This substantially reduced manual image processing effortsand enabled researchers to detect changing patterns in marine invertebrates much fasterby reducing the need for human intervention.
    • Awarded to Solomon Kimunyu
      Road segmentation challenge - Designed a road segment identification algorithm using fastai and keras that identifieswhether a satellite image contains a road segment or not. This allowed governmentofficials to focus on areas they might need to send an official to confirm the roadplacement and add it to the government’s maps and road networks.
    • Awarded to Solomon Kimunyu
      Sea tutle conservation - Developed an artificial intelligence model that identified and distinguished betweenindividual sea turtles using keras and fastai.This automated the manual process ofindividual turtle recognition saving cost and time.
    • Awarded to Solomon Kimunyu
      Second runners up, GBV competition - Solomon was also ranked 3rd in the gender-based violence tweet classification challenge. In this challenge, Solomon created a machine learning algorithm that classifies tweets about gender-based violence into five categories: sexual violence, emotional violence, harmful traditional practices, physical violence, and economic violence.He was awarded Zindi points and achieved an overall rank of 52 out of over 30,000 data scientists on the platform.
    • Awarded to Solomon Kimunyu
      swahili social conversations challenge - Built a machine learning model that performs sentiment analysis on Swahili text datausing Bert and NLTK. This model helped in analyzing swahili social conversations anddetermined deeper context of swahili tweets as they apply to a topic brand or a theme.
  • Volunteer Experience

    • Instructor

      Issued by Power Learn Project
      Power Learn ProjectAssociated with Solomon Kimunyu
    • Founder

      Issued by Desolo Analytics Lab
      Desolo Analytics LabAssociated with Solomon Kimunyu
    • Advisor

      Issued by KARATINA UNIVERSITY ACTUARIAL STUDENTS ASSOCIATION
      KARATINA UNIVERSITY ACTUARIAL STUDENTS ASSOCIATIONAssociated with Solomon Kimunyu
    • President

      Issued by KARATINA UNIVERSITY ACTUARIAL STUDENTS ASSOCIATION
      KARATINA UNIVERSITY ACTUARIAL STUDENTS ASSOCIATIONAssociated with Solomon Kimunyu
    • Founder

      Issued by Data scientist community
      Data scientist communityAssociated with Solomon Kimunyu
    • Campus Representative

      Issued by The Actuarial Society of Kenya (TASK)
      The Actuarial Society of Kenya (TASK)Associated with Solomon Kimunyu
    • Vice President

      Issued by KARATINA UNIVERSITY ACTUARIAL STUDENTS ASSOCIATION
      KARATINA UNIVERSITY ACTUARIAL STUDENTS ASSOCIATIONAssociated with Solomon Kimunyu