
Timeline
About me
Data Scientist @ Emplay Inc. | Langchain | GenAI | Data Science | SQL | Software Engineering | FastAPI | Docker | Pytest | RabbitMQ
Education

Lovely professional university
2018 - 2022Bachelor of technology - btech computer scienceActivities and Societies: web development, chess club

Delhi public school aligarh
2004 - 2017High school : non medical (science) 80.20%Senior Secondary Board of Education (CBSE)Along with Computer-Science as a subject
Experience

Lovely professional university
Jan 2022 - Apr 2022Undergraduate research assistantData Logging for a real-time flow of packet, registering and categorizing multiple features of a packetBuilding Multiple Data Visualization to get a better grasp of data Using Arima Model to predict further congestion or forthcoming coming violation in a network

Emplay inc.
Apr 2022 - Sept 2022Data science internDuring my internship at Emplay, I spearheaded the development of an AutoTagging application designed to enhance our customers' file management system. My project involved several key phases:1)Data Preparation: Leveraged Pandas to clean and preprocess training data, ensuring the dataset was optimized for model training.2)Data Segregation: Employed Scikit-learn (sklearn) to effectively segregate the data, enabling accurate and efficient machine learning processes.3)Model Training and Deployment: Utilized Google Cloud Platform's Vertex AI to train and deploy a robust tagging model. This model automatically generates tags for files based on their descriptions and titles, adhering to a pre-established tagging methodology within the system.Through this project, I honed my skills in data science, machine learning, and cloud-based AI solutions, contributing to the improvement of Emplay's service offerings. Show less

Emplay inc.
Feb 2023 - nowAs a Data Scientist at Emplay, I have been dedicated to enhancing our data infrastructure and developing innovative cloud-native applications. My key responsibilities and achievements include:1)Event-Driven Ingestion Services: Worked closely on transforming ingestion services to be event-driven using RabbitMQ as a message broker. This approach improved the efficiency and scalability of our data processing workflows.2)Cloud Native Application Development: Leveraged the Google Cloud Platform (GCP) to develop cloud-native applications that serve as counterparts to our in-house developed apps. This ensured seamless integration and enhanced the overall performance and reliability of our systems.3)Data Infrastructure Optimization: Continuously optimized data ingestion and processing pipelines to ensure high availability, reliability, and scalability. This included the integration of advanced monitoring and alerting mechanisms to maintain robust data workflows.4)Cross-Functional Collaboration: Collaborated with various teams to align our data solutions with business needs and technical requirements. This included working with software engineers, data analysts, and product managers to deliver high-quality data products.5)Innovative Solutions Implementation: Introduced and implemented new technologies and methodologies to streamline data operations and improve overall system efficiency. This included adopting best practices for cloud computing, containerization, and microservices architecture.Through these efforts, I have significantly contributed to the modernization and optimization of Emplay's data infrastructure, enhancing our capability to deliver high-quality, data-driven solutions to our clients. Show less As an Associate Data Scientist at Emplay, I played a pivotal role in developing and optimizing various data-driven applications and services. My responsibilities and achievements included:1)Pipeline Development: Designed and implemented a robust data processing pipeline, ensuring seamless data flow and integration across multiple systems.2)Endpoint Creation: Utilized FastAPI to develop efficient and scalable endpoints, facilitating smooth data access and interaction.3)Quality Assurance: Introduced pytest into the development workflow to ensure comprehensive testing of services, enhancing reliability and performance.4)Docker Optimization: Addressed challenges related to the large size of Docker images by implementing SlimToolkit, a tool for minimizing Docker images, thereby improving deployment efficiency.5)Application Development: Contributed to multiple applications aimed at Retrieval-Augmented Generation (RAG) and the ingestion of customer files into Elasticsearch. These applications supported downstream services for semantic search and context-based inferencing, leveraging Large Language Models (LLMs) to generate customer-specific solutions.6)AI Safety and Monitoring: Collaborated with WhyLabs to integrate guardrails on LLM responses, ensuring that outputs were accurate, safe, and aligned with user expectations.7)Client Collaboration: Maintained and improved tagging systems for SAP, a Fortune 500 client, enhancing their learning platform and ensuring accurate and efficient data categorization.Through these efforts, I enhanced my expertise in data science, API development, containerization, machine learning, and AI safety, significantly contributing to Emplay's technological advancements and service quality. Show less
Data Scientist
Apr 2024 - nowAssociate Data Scientist
Feb 2023 - Apr 2024
Licenses & Certifications
- View certificate

Pythonic style of programming: tips and tricks
Educative, inc.Apr 2023 - View certificate

Become a flask developer
Educative, inc.May 2023 - View certificate

Python for programmers
Educative, inc.Apr 2023 - View certificate

Associate data scientist in python
DatacampMay 2024 - View certificate

Python programmer track
DatacampMay 2022 - View certificate

Elasticsearch 8 and the elastic stack: in depth and hands on
UdemyMar 2023 - View certificate

Data analyst with python track
Datacamp 
Data structures and algorithm
GeeksforgeeksJun 2020- View certificate

Web developer bootcamp with flask and python
UdemyJun 2023
Languages
- enEnglish
- hiHindi
- frFrench
Recommendations

Shruthi a
Project Manager | Ex-MicrosoftHyderabad, Telangana, India
Riya watts
|| Assistant Manager at HDFC Bank || BANKER ||Fazilka, Punjab, India
Brett thom
Staff Engineer | Full Stack Development @ AnewHealthGreater Philadelphia
Bogdan păun, m.sc., mba, pmp
Manager, Project Management Department at Hidroelectrica SABucharest, Bucharest, Romania
Chaitali chaudhari
Senior Software Development Engineer @ Xperi Inc | Ex-Atos | PICT'19Pune, Maharashtra, India
Piyush khodke
Strategic Sourcing-Lectrix|Ex-Jawa Yezdi Motorcycles|E-Vehicles| Electrical EngineerBengaluru, Karnataka, India
Wahyu mustakim
Engineer | Cost control | Assessor CompetencyPekanbaru, Riau, Indonesia
Nate steele
Digital Marketing Specialist at XPELLunenburg, Massachusetts, United States
Md. mahe ul haque
Engineering Officer at Beximco Pharmaceuticals Ltd.Bangladesh
Marc pallarès olivares
Consultoría Estratégica en Gestión de Flotas | Senior BDR en Bridgestone Mobility SolutionsBarcelona, Catalonia, Spain
Ashley mills
Sr. Data Mining AnalystCoxs Creek, Kentucky, United States
Todd hagerich
Art Director | Industrial Designer | Graphic Designer | Marketing Designer | Packaging Designer | Il...Pittsburgh, Pennsylvania, United States_Johnson.webp)
Victoria (tori) johnson
MS-RN, Epic Application Analyst Senior at SSM HealthMaryland Heights, Missouri, United States
Seth baker
Field Service Rep at Turbine Engine Specialists Inc.Fort Worth, Texas, United States
Scott wolber
Senior Director Of Operations at LiquiTech: Legionella & Waterborne Pathogen PreventionGreater Chicago Area
Andrea anderson
Tenancy AdvocateWarrenup, Western Australia, Australia
Sydney schultz
Data Analyst & Integration Project Manager at A&O ShearmanUnited States
Mohammad elayyan
Head of IT, IT Application and technology Professional , Subject Matter Expert In Banking Applicatio...Manama, Capital Governorate, Bahrain
Gabriela dinu
Social Media ManagerBucharest, Romania
Adrian theopulos
UX Designer | Product DesignerAlbuquerque-Santa Fe Metropolitan Area
...