Explore my Projects 

Predictive Modeling for diabetes diagnosis
Featured Project

Predictive Modeling for diabetes diagnosis

Python
Pandas
Numpy
Matplotlib
Seaborn
Sklearn

This project delves into the realm of diabetes diagnosis, employing a sophisticated machine learning approach centered around logistic regression. Through the utilization of this powerful statistical model, the aim is to analyze and interpret patient data, facilitating accurate predictions for the presence of diabetes. The focus on logistic regression ensures a robust and interpretable framework, contributing to the development of a reliable diagnostic tool for improved healthcare outcomes.

Bike Rental Prediction Project
Featured Project

Bike Rental Prediction Project

R Programming
Tidyverse
Skimr
Lattice
Car
Broom
OLSRR
Lindia
Lmtest
ggplot
GGally
Dplyr
Magritter

The objective is to accomplish the task by employing a fitting multiple linear regression model. The response variable in question is the Count of total rental bikes, with predictors identified through logical and statistical methods.

Analysis of Covid-19 data in New Zealand

Analysis of Covid-19 data in New Zealand

This analysis employs statistical methods and data visualization in Rmarkdown and HTML to explore New Zealand's COVID-19 data. Vital for the data science community, it unveils insights into the pandemic's impact, aiding governments in crisis communication.
R Programming
ggplot2
maptools
tibble
tidyverse
ggrepel
png
grid
sp
coronavirus
magrittr
plotly
911 Calls Capstone Project

911 Calls Capstone Project

Analyzing 911 call data can yield valuable insights into emergency response patterns, geographic trends, and other factors that contribute to the improvement of emergency services. This process involves visualizing data, identifying patterns, and extracting meaningful information.
Python
Numpy
Pandas
Seaborn
Matplotlib
Sklearn
Statistical Analysis for a Research
Featured Project

Statistical Analysis for a Research

R Programming

Played a pivotal role in a research project "Developing Companion Robot for Reducing Stress and Increasing Workability" focused on developing a companion robot to reduce stress and enhance workability. Conducted in-depth statistical analysis of the research results, employing various data analysis techniques and tools. Provided valuable insights and data-driven conclusions that contributed significantly to the project's success.

Finance Project

Finance Project

This data project focuses on exploring stock prices through comprehensive exploratory data analysis. By delving into historical data, aim to uncover patterns, trends, and correlations, offering valuable insights for investors and analysts. Through statistical and graphical techniques, this analysis contributes to informed decision-making in the dynamic stock market landscape.
Python
Pandas
Matplotlib
Seaborn
Plotly
E-commerce Optimization: A Data-Driven Approach for NYC Fashion Retailer

E-commerce Optimization: A Data-Driven Approach for NYC Fashion Retailer

This project assesses a NYC-based fashion retailer's digital strategy, deciding between mobile app and website focus. Utilizing linear regression modeling, it aims to streamline the customer journey post in-store style sessions, providing insights for strategic resource allocation and an improved E-commerce experience.
Python
Pandas
Matplotlib
Seaborn
Sklearn
Neighborly Classifications: K-Nearest Neighbors for Predictive Modeling

Neighborly Classifications: K-Nearest Neighbors for Predictive Modeling

This project harnesses the power of K-Nearest Neighbors (KNN) to develop a model capable of directly predicting a class for new data points based on their features. By leveraging the proximity of existing data points, this approach offers an efficient and intuitive method for classification tasks, enhancing predictive accuracy in various applications.
Python
Pandas
Matplotlib
Seaborn
Sklearn
Credit Clarity: Predicting Loan Repayment with Decision Trees and Random Forest

Credit Clarity: Predicting Loan Repayment with Decision Trees and Random Forest

This project employs Decision Trees and Random Forest algorithms to classify borrowers' likelihood of loan repayment. Utilizing a cleaned dataset, the goal is to create a robust predictive model, aiding investors in identifying individuals with a high probability of fulfilling their loan commitments.
Python
Pandas
Matplotlib
Seaborn
Sklearn
SVM Iris Analysis Unveiling Patterns in Botanical Data
Featured Project

SVM Iris Analysis Unveiling Patterns in Botanical Data

Python
Pandas
Matplotlib
Seaborn
Sklearn

Dive into the Support Vector Machine (SVM) project dissecting the famous iris dataset. This analysis unravels intricate patterns and harnesses SVMs to provide insightful conclusions about the underlying botanical data.

Unsupervised University Classification: K-Means Clustering Project

Unsupervised University Classification: K-Means Clustering Project

This project applied K-Means Clustering to categorize universities as Private or Public. Utilizing available labels in an unconventional manner, the inclusion of a classification report and confusion matrix acknowledges their limited relevance in typical K-Means scenarios without labels.
Python
Pandas
Matplotlib
Seaborn
Sklearn
NLP Sentiment Analysis: Yelp Review Classification

NLP Sentiment Analysis: Yelp Review Classification

Dive into Natural Language Processing, classifying Yelp reviews into 1-star or 5-star categories. Leveraging the dataset, each observation reflects a user's sentiment toward a business. The "stars" column rates the business, while "cool," "useful," and "funny" quantify engagement. This project simplifies classification using NLP and pipelines.
Python
Pandas
Matplotlib
Seaborn
Sklearn