-
Software Engineer @ Copart
Aug 2024 — ongoing | Dallas, TX
Technologies used:
-
Graduate Research Assistant
Jan 2024 — Aug 2024 | Richardson, TX
Developed a chatbot application for UTD-related queries using Retrieval-Augmented Generation (RAG),
with UTD documents and web page content stored in S3 and their vector representations managed in
OpenSearch for efficient query similarity search.
Deployed a fine-tuned Llama 3.1 model on AWS Bedrock, enabling accurate and context-aware responses
to user questions.
Implemented a FastAPI backend with LangChain to process chatbot queries,
scaling dynamically based on the traffic load using EKS.
Configured Elastic Load Balancer (ELB) to ensure efficient traffic distribution, high availability,
and optimal performance under varying request volumes.
Technologies used:
-
Software Dev Engineer 2 @ Swiggy
Oct 2022 — Jul 2023 | Bangalore, India
Designed PyTorch model serving capability within the Spring Boot codebase
using the Deep Java Library and Nvidia Triton for our in-house data science platform,
enabling PyTorch integration for production serving and reduced TensorFlow model memory usage by 30%.
Implemented robust platform cost monitoring mechanisms,
leading to a significant reduction in the Data Science team's compute usage by approximately 45%
Developed an observability tool using Databricks, Kafka, and Spark,
hosting and monitoring 80+ deployed data science models in real-time,
improving governance and reducing response times by 20%.
Technologies used:
-
Machine Learning Engineer 1 @ Swiggy
Dec 2021 — Sept 2022 | Bangalore, India
Expanded the reach of smart push notifications to 20 million users,
while drastically reducing Spark job runtime from 9 hours to 2 hours
and cutting compute costs by ~75% (from $688.74 to $166.84).
This resulted in a daily order increment of 117.6 on average and a 14% decrease in push notification uninstall rates
Improved the spell correction TensorFlow model inference time to under 100ms,
successfully tested at a load of 1000 requests per second
Developed a proof-of-concept (POC) for an insights dashboard
aimed at providing actionable competitor insights to restaurant partners
Technologies used:
-
Data Science Engineer @ Stylumia
Jun 2020 — Dec 2021 | Bangalore, India
Led development using SQS, Elasticsearch, HuggingFace,
and FastAPI, achieving 1200 RPS throughput and 30 ms P99,
which increased user engagement by 15%.
Automated the image-tagging process for customer-specific report generation,
cutting lead time by 50% and resulting in an 18% increase in customer conversion rate
Spearheaded the use of Jenkins for microservice deployment automation through CI/CD pipelines
and established a culture of thorough code reviews,
enhancing deployment speed and code quality.
Technologies used:
-
Research Intern @ Siemens
Jan 2020 — Jun 2020 | Bangalore, India
Engineered an attention-based algorithm to estimate weight and water content from potato images,
increasing estimation accuracy by 8% and reducing processing time by 30%.
Integrated the algorithm into an internal food quality assessment tool being prototyped at Siemens,
using Django and Docker for seamless operation
Technologies used:
-
Intern @ Flutura
Jun 2019 — Jul 2019 | Bangalore, India
Contributed to the development of the chatbot feature on Flutura's Cerebra platform.
The project used the Django REST framework, PostgreSQL,
and DialogFlow for the backend, and Angular for the frontend
Responsible for adding new features
and managing existing dialogue options in the backend part of the project
Technologies used: