Profile Photo
Sarvesh Kharche

MS in Data Science

Rutgers University

location_on New Brunswick, NJ
About

I'm Sarvesh Kharche, an MS in Data Science student at Rutgers University (May 2025) with experience as an Analyst.

With a strong foundation in machine learning, statistical analysis, and big data. Experienced in designing and deploying scalable data pipelines, optimizing real-time systems, and developing AI-driven solutions – including generative AI applications. Adept at translating complex data into actionable insights to support product strategy and innovation.

Skills
code Programming
Python
SQL
R
data_usage Machine Learning & AI
scikit-learn
TensorFlow
PyTorch
Hugging Face Transformers
Langchain/Langgraph
Generative AI
psychology Data Engineering & Big Data
AWS
PySpark
Kafka
ETL/ELT
Data Pipelines
Cloud Computing
data_usage Data Visualization & Analysis
Pandas
NumPy
Matplotlib
Seaborn
Tableau
Streamlit
code Tools & Developer Environment
Git
Docker
Jupyter Notebook
VS Code
Projects
Smart Travel Planner AI Project
Smart Travel Planner AI
AI AWS Langgraph

This project is a production-ready smart travel planner that uses a multi-agent AI system powered by Amazon Bedrock to generate personalized, budget-aware travel itineraries for worldwide destinations from natural language queries. Live Application.

Live Application arrow_forward

Read more arrow_forward

Github arrow_forward

Flashcard Generation Project
Automated Flashcard Generation
NLP Transformers LoRA

Developed an NLP-powered tool that automatically generates high-quality flashcards from academic notes using a fine-tuned T5 model with Low-Rank Adaptation (LoRA).

Read more arrow_forward

Github arrow_forward

Autonomous Research Assistant For Literature Review
Autonomous Research Assistant For Literature Review
AI RAG Langchain

I developed an AI-powered research assistant that streamlines academic research by automatically gathering, summarizing, and synthesizing literature from sources like arXiv. This project leverages advanced techniques such as Retrieval Augmented Generation and interactive dashboards to enhance research workflows.

Read more arrow_forward

Github arrow_forward

Twitter Data Search App
Twitter Data Search Application
Azure SQL MongoDB Databricks

Designed an efficient system for collecting, storing, and retrieving Twitter data using Azure SQL and MongoDB with caching mechanisms to enhance query performance.

Read more arrow_forward

Github arrow_forward

Cricket Match Simulation
Cricket Match Simulation
Machine Learning Random Forest XGBoost

Simulates T20 cricket matches by predicting scores and wickets using IPL ball-by-ball data and player statistics, leveraging KNN clustering for strategic insights.

Read more arrow_forward

Github arrow_forward

Education
school Master of Science in Data Science

Rutgers University - New Brunswick

Sep. 2023 - May 2025

Relevant Coursework: Probability and Statistical Inference, Financial Data Mining, Database Management, Natural Language Processing, Data Structures and Algorithms, Statistical Modeling, Regression and Time Series Analysis, Data Analysis and Visualization.

school Bachelor of Engineering in Information Technology

University of Mumbai

Aug. 2017 - Jun. 2021

Experience
Cybersecurity Data Analyst

Tata Consultancy Services (Aug. 2021 - Jun. 2023)

  • Reduced incident response times by 40%, measured by faster security alert resolution, by implementing a real-time monitoring framework with advanced data processing, quantitative alert prioritization, and defined SIEM/SOC procedures.
  • Improved security posture & informed C-level strategy, leading to enhanced protocol adoption, by creating executive-ready visualizations of cybersecurity trends and vulnerabilities.
  • Cut security incidents by 30% & boosted threat detection by 20%, per incident logs & efficacy rates, by partnering with vendors and cross-functional teams to redefine detection metrics and protocols.
Data Analytics Intern

Eduvance (Jun. 2019 - Jul. 2019)

  • Applied Python (Pandas, NumPy) for ETL and preprocessing of large datasets, transforming raw data into analysis-ready formats during the data analytics bootcamp.
  • Built interactive Python dashboards (Matplotlib) to visualize data patterns and derive actionable insights within the data analytics bootcamp project.