Projects
Emotionally Intelligent Chatbot
Built a memory-augmented conversational AI that combines Cache- and Retrieval-Augmented Generation (CAG + RAG) to maintain context and respond with emotional intelligence. It fine-tunes lightweight LLMs (GPT-4o-mini, LLaMA 3.2 1B) and uses ChromaDB for fast, personalized memory retrieval, ensuring relevant and empathetic user interactions.
Umass Hangout
Developed a student collaboration platform using ReactJS, Spring Boot, MySQL, Elasticsearch, and Docker to streamline group creation, event scheduling, and activity management. Features include user authentication, profile management, real-time group discovery with Elasticsearch-powered search, and calendar invite integration.
Simulation of a Centralized Quantum Network for Multipartite Entanglement Distribution
rBuilt a Julia-based simulation modeling a star-topology quantum network in which a central node generates and distributes GHZ states to multiple end nodes, incorporating realistic noise effects (Werner states, dephasing) and probabilistic Bell state measurement failures. By running over one million trials, the project evaluates how link distance and the number of end nodes impact key metrics fidelity, latency, and success probability revealing that longer distances and more participants sharply increase latency and decrease both fidelity and success rates.
Design and Development of an Efficient Risk Prediction Model for Cervical Cancer (Published)
A Gradient Boosting–based model was developed to predict an individual’s risk of developing cervical cancer by incorporating a comprehensive set of lifestyle, clinical, and demographic factors (e.g., age, sexual history, HPV status, smoking). After preprocessing (including handling missing values and creating a combined “Cancer status” target variable from Dx:Cancer and Dx:CIN), feature importance was determined via XGBoost to select the most relevant predictors, and SMOTE oversampling plus stratified cross-validation were applied to address class imbalance and optimize performance resulting in a final model accuracy of 98.9% .
DistilBERT-BiLSTM for Multi-Category Web Content Classification (Published)
A hybrid model combining DistilBERT’s contextual embeddings with a Bidirectional LSTM (BiLSTM) layer to classify web pages into up to ten categories (e.g., Science & Technology, NSFW, Entertainment, News, Fitness). By leveraging transformer-based feature extraction and sequential modeling, it achieves precise identification of diverse content types including text, audio, video, and images thereby enhancing web filtering effectiveness for all age groups.
Enhanced Credit Card Fraud Detection with Tuned Isolation Forest Ensemble
This study combines K-Nearest Neighbors, Random Forest, Logistic Regression, and an Isolation Forest (iForest) within an ensemble framework automatically optimizing iForest hyperparameters via GridSearchCV to detect credit card fraud in real time. By comparing individual models to the tuned ensemble, it demonstrates that integrating multiple classifiers and fine-tuning iForest significantly improves detection accuracy and adaptability across varying transaction datasets.