I’m a data analyst with a Master’s degree in Information Technology and Analytics from Rochester Institute of Technology (GPA: 3.9). I specialize in transforming complex datasets into actionable insights through ETL automation, interactive dashboards, and machine learning models. My experience spans across transportation, telecom, and retail sectors, where I’ve applied tools like Python, SQL, Snowflake, Tableau, Power BI, Airflow, and AWS to solve business-critical challenges. I’m passionate about data storytelling, cloud analytics, and building intelligent systems that drive smarter, faster decisions.
0 + Projects completed
Seasoned Data Analyst with 2+ years of experience driving business strategies through data-driven insights. Proven expertise in data science, statistical analysis, machine learning algorithms and project managemen
Structured skillset grouped by domain and proficiency, combining tools, platforms, and techniques across analytics, engineering, and cloud technologies.
Python
SQL
R
Java
C++
PySpark
Tableau
Power BI
Looker Studio
QuickSight
D3.js
Excel
Google Sheets
dbt
Apache Spark
Apache Kafka
Kubernetes
Docker
PostgreSQL
MongoDB
AWS
Google Cloud
Azure
Snowflake
Databricks
Amazon EC2
Amazon S3
AWS Lambda
Amazon RDS
Amazon Redshift
AWS Glue
Jupyter
TensorFlow
PyTorch
Scikit-learn
Keras
Kaggle
HTML5
CSS3
JavaScript
React
Node.js
Git
CI/CD
Rochester Institute of Technology (RIT) is a top-ranked U.S. university known for its strong focus on technology, innovation, and experiential learning. It offers leading programs in data science, computing, and analytics with a strong industry connection.
StandardWings Technologies Pvt. Ltd., founded in 2014 and based in Nashik, India, is a digital solutions provider specializing in web and mobile app development, IoT, AI/ML systems, and SAP services. They serve diverse industries, including government, manufacturing, BFSI, healthcare, and logistics, delivering innovative and cost-effective technology solutions. Their expertise spans frontend and backend development, CMS, cross-platform mobile apps, and enterprise mobility solutions.
StandardWings Technologies Pvt. Ltd., founded in 2014 and based in Nashik, India, is a digital solutions provider specializing in web and mobile app development, IoT, AI/ML systems, and SAP services. They serve diverse industries, including government, manufacturing, BFSI, healthcare, and logistics, delivering innovative and cost-effective technology solutions. Their expertise spans frontend and backend development, CMS, cross-platform mobile apps, and enterprise mobility solutions.
Grade: 3.9/4.0
Relevant Coursework: Data Science & Analytics, Database Design, Non-Relational Databases, Data Warehousing, Visual Analytics, Information Retrieval & Text Mining, Time Series Forecasting, Knowledge Discovery
Grade: 9.27/10.00
Relevant Coursework: Data Structures & Algorithms, Design & Analysis of Algorithms, Theory of Computation Database Management Systems, Operating Systems, Web Technologies, Cloud Computing, Software Engineering, Artificial Intelligence, Machine Learning, Data Analytics, Big Data & Hadoop, Data Mining, Applied Mathematics
Below are the sample Data Analytics projects on SQL, Python, Power BI & ML.
HealthLens is a data-driven analytics project that uncovers operational inefficiencies in healthcare using Python-based EDA and interactive Power BI dashboards. By integrating multiple tables like billing, prescriptions, and diagnoses, it enables hospital administrators to track KPIs, spot anomalies, and make evidence-based decisions. Tools used include Pandas, Seaborn, Plotly, Power BI, and SQL.
CareAllocate models optimal distribution of limited healthcare resources (beds, staff, ventilators) across regions using linear programming and constraint optimization. Built with Python, Pandas, PuLP, and Streamlit, it helps policymakers or hospital admins make critical real-time decisions during pandemics or emergencies. Visual outputs support transparency and faster strategic planning.
ForecastPro is a robust MLOps pipeline that automates sales forecasting using Prophet and Scikit-learn. It includes CI/CD, model registry, and experiment tracking via MLflow and DVC. Designed for scalability, the system forecasts product demand and supports better inventory planning. The stack includes Python, FastAPI, Docker, GitHub Actions, and Streamlit.
AcuMedica is a Power BI–driven dashboard that visualizes key healthcare performance metrics such as patient outcomes, diagnosis frequency, and treatment trends. Using DAX, Power Query, and structured healthcare data, it provides stakeholders with actionable insights to improve clinical decision-making, resource planning, and operational efficiency in hospitals or clinics.
InfoSnip is a tailored news summarization tool designed to tackle the growing problem of information overload by transforming extensive news articles into concise and easily digestible summaries. Leveraging advanced Natural Language Processing (NLP) models, InfoSnip efficiently condenses long news pieces, making it easier for users to stay informed without spending excessive time reading full-length articles.
This project contains a machine learning-based solution for detecting dyslexia using behavioural and cognitive data. Dyslexia Detection leverages advanced machine learning models to identify dyslexic patterns in individuals, aiding in early diagnosis and intervention. The project demonstrates how data-driven approaches can enhance diagnostic processes, potentially providing more accurate and faster results than traditional methods.
Deep learning has revolutionized the analysis and interpretation of satellite and aerial imagery, addressing unique challenges such as vast image sizes and a wide array of object classes. This repository provides an exhaustive overview of deep learning techniques specifically tailored for satellite and aerial image processing. It covers a range of architectures, models, and algorithms suited for key tasks like classification, segmentation, and object detection.
This repository is dedicated to the project "Black Gold Horizon: Projecting America's Oil Future", a comprehensive analysis and forecast of crude oil production in the United States, both at the national level and for specific regions. The project utilizes advanced time-series analysis and forecasting models in R to provide predictions of future production levels. The goal of the project is to aid policymakers, industry professionals, and researchers in making data-driven decisions in the energy sector.
YouTrendify is a machine learning-based tool designed to provide predictive insights into YouTube video metrics, with a focus on subscriber growth, video ranking, and revenue estimation. By leveraging regression models, feature engineering, and advanced hyperparameter tuning, YouTrendify allows YouTube content creators and analysts to make data-driven decisions that can improve content strategy and performance.
This project, completed as part of the VAST Challenge 2016, focuses on analyzing operational data from the GASTech building. The data includes employee movements, HVAC sensor readings, and environmental parameters such as CO2 and Hazium gas levels. The goal of the project is to identify patterns, detect anomalies, and understand causal relationships between employee behavior and building conditions..
StyleSync is an AI-powered fashion compatibility system that analyzes user-uploaded clothing images and recommends visually and contextually compatible outfits. Built on the Maryland Polyvore dataset using a ResNet-50 and Multi-Layered Comparison Network (MCN) architecture, the system scores outfit compatibility and suggests matching items. It incorporates SerpAPI for real-time product recommendations with clickable links and includes a user feedback loop to personalize suggestions based on thumbs up/down ratings. The solution balances deep learning, API integration, and intuitive UI design to deliver a smart, personalized styling experience.
This project involves the creation of a visually informative dashboard to analyze Airbnb listings in New York City. The goal is to provide insights into rental prices, availability, and other key metrics across the boroughs of NYC using data visualization techniques. The dashboard enables users to interact with the data, making it easier to derive meaningful patterns and trends in the rental market.
Built a Formula 1 Data Engineering project using Spark on Azure Databricks and Delta Lake architecture. Formula 1 season happens once a year roughly 20 races. Each race happens over a weekend. Roughly 10 teams (constructors) participate in a season. Each team have two drivers who participate in the race. Two drivers get qualified from the entire team and they get to start the race earlier. Each driver can have multiple pit stops to change tires or fix damaged car. Based on the race results, driver standings and constructor standings are decided. The top of the drivers standings becomes the drivers' champion and the team that tops the constructor standings, becomes the constructors' champion.
This project focuses on analyzing customer churn within a telecommunications company using the Telco Customer Churn Dataset. The primary goal is to develop predictive models to assess customer churn and monthly charges. Several machine learning techniques, such as regression, classification, and clustering, were employed to extract insights and predict customer behavior.
Authored technical publications on data analytics and machine learning, focusing on forecasting models, performance analysis, and real-world applications..
Co-authored and published research at IEEE CONIT 2022 on human activity recognition using deep learning and computer vision. Built a 3D CNN with incremental learning to classify human actions (walking, jogging, running, boxing, waving, clapping) on the KTH dataset. Achieved 98.88% accuracy, outperforming prior methods while preventing catastrophic forgetting and enabling scalable real-time analysis.
This paper delves into the complexities of modern malware, which has advanced from simple, single-purpose software to sophisticated polymorphic variants, posing significant challenges to cybersecurity efforts. Traditional malware detection methods, reliant on signature-based classification, are increasingly ineffective against these evolving threats. Our research provides a comprehensive overview of contemporary malware detection strategies, emphasizing cutting-edge approaches such as artificial intelligence (AI), machine learning (ML) classification, deep learning, autoencoders, and IoT cloud environments.
Proposed a system combining transfer learning and incremental learning for human action recognition in videos. The model addresses challenges like catastrophic forgetting and retraining costs by learning new actions without losing prior knowledge. Evaluated on KTH and UCF101 datasets, it shows improved accuracy and efficiency over existing approaches.
Published research in American Journal of Electronics & Communication (2022) on predictive climate analytics. Analyzed 60+ years of global temperature and emissions data (NASA, UN) using regression models to forecast long-term temperature trends. Delivered insights on correlations between CO2 emissions, deforestation, and temperature anomalies through advanced data wrangling and statistical modeling.
Below are the details to reach out to me!