Rajat Maheshwari

Rajat maheshwari

bookmark on deepenrich
location of Rajat MaheshwariAligarh, Uttar Pradesh, India
Followers of Rajat Maheshwari489 followers
  • Timeline

  • About me

    Data Scientist @ Emplay Inc. | Langchain | GenAI | Data Science | SQL | Software Engineering | FastAPI | Docker | Pytest | RabbitMQ

  • Education

    • Lovely professional university

      2018 - 2022
      Bachelor of technology - btech computer science

      Activities and Societies: web development, chess club

    • Delhi public school aligarh

      2004 - 2017
      High school : non medical (science) 80.20%

      Senior Secondary Board of Education (CBSE)Along with Computer-Science as a subject

  • Experience

    • Lovely professional university

      Jan 2022 - Apr 2022
      Undergraduate research assistant

      Data Logging for a real-time flow of packet, registering and categorizing multiple features of a packetBuilding Multiple Data Visualization to get a better grasp of data Using Arima Model to predict further congestion or forthcoming coming violation in a network

    • Emplay inc.

      Apr 2022 - Sept 2022
      Data science intern

      During my internship at Emplay, I spearheaded the development of an AutoTagging application designed to enhance our customers' file management system. My project involved several key phases:1)Data Preparation: Leveraged Pandas to clean and preprocess training data, ensuring the dataset was optimized for model training.2)Data Segregation: Employed Scikit-learn (sklearn) to effectively segregate the data, enabling accurate and efficient machine learning processes.3)Model Training and Deployment: Utilized Google Cloud Platform's Vertex AI to train and deploy a robust tagging model. This model automatically generates tags for files based on their descriptions and titles, adhering to a pre-established tagging methodology within the system.Through this project, I honed my skills in data science, machine learning, and cloud-based AI solutions, contributing to the improvement of Emplay's service offerings. Show less

    • Emplay inc.

      Feb 2023 - now

      As a Data Scientist at Emplay, I have been dedicated to enhancing our data infrastructure and developing innovative cloud-native applications. My key responsibilities and achievements include:1)Event-Driven Ingestion Services: Worked closely on transforming ingestion services to be event-driven using RabbitMQ as a message broker. This approach improved the efficiency and scalability of our data processing workflows.2)Cloud Native Application Development: Leveraged the Google Cloud Platform (GCP) to develop cloud-native applications that serve as counterparts to our in-house developed apps. This ensured seamless integration and enhanced the overall performance and reliability of our systems.3)Data Infrastructure Optimization: Continuously optimized data ingestion and processing pipelines to ensure high availability, reliability, and scalability. This included the integration of advanced monitoring and alerting mechanisms to maintain robust data workflows.4)Cross-Functional Collaboration: Collaborated with various teams to align our data solutions with business needs and technical requirements. This included working with software engineers, data analysts, and product managers to deliver high-quality data products.5)Innovative Solutions Implementation: Introduced and implemented new technologies and methodologies to streamline data operations and improve overall system efficiency. This included adopting best practices for cloud computing, containerization, and microservices architecture.Through these efforts, I have significantly contributed to the modernization and optimization of Emplay's data infrastructure, enhancing our capability to deliver high-quality, data-driven solutions to our clients. Show less As an Associate Data Scientist at Emplay, I played a pivotal role in developing and optimizing various data-driven applications and services. My responsibilities and achievements included:1)Pipeline Development: Designed and implemented a robust data processing pipeline, ensuring seamless data flow and integration across multiple systems.2)Endpoint Creation: Utilized FastAPI to develop efficient and scalable endpoints, facilitating smooth data access and interaction.3)Quality Assurance: Introduced pytest into the development workflow to ensure comprehensive testing of services, enhancing reliability and performance.4)Docker Optimization: Addressed challenges related to the large size of Docker images by implementing SlimToolkit, a tool for minimizing Docker images, thereby improving deployment efficiency.5)Application Development: Contributed to multiple applications aimed at Retrieval-Augmented Generation (RAG) and the ingestion of customer files into Elasticsearch. These applications supported downstream services for semantic search and context-based inferencing, leveraging Large Language Models (LLMs) to generate customer-specific solutions.6)AI Safety and Monitoring: Collaborated with WhyLabs to integrate guardrails on LLM responses, ensuring that outputs were accurate, safe, and aligned with user expectations.7)Client Collaboration: Maintained and improved tagging systems for SAP, a Fortune 500 client, enhancing their learning platform and ensuring accurate and efficient data categorization.Through these efforts, I enhanced my expertise in data science, API development, containerization, machine learning, and AI safety, significantly contributing to Emplay's technological advancements and service quality. Show less

      • Data Scientist

        Apr 2024 - now
      • Associate Data Scientist

        Feb 2023 - Apr 2024
  • Licenses & Certifications