Found 49 repositories(showing 30)
ANALYZING ROAD SAFETY & TRAFFIC DEMOGRAPHICS IN THE UK (Multi-class Classification) SUMMARY Here, I am aim to analyze the Road Safety and Traffic Demographics dataset (UK), containing accidents reported by the police between the years of 2004 - 2017. PROJECT GOALS: Identify factors responsible for most of the reported accidents. Build a machine learning model that is capable of accurately predicting the severity of an accident. Provide recommendations to the Department of Transport (UK Government), to improve road safety policies and prevent recurrences of severe accidents where possible. PACKAGES USED: Scikit-learn, numpy, pandas, imblearn (imbalanced-learn), seaborn, Matplotlib MOTIVATION World Health Organization (WHO) reported that more than 1.25 million people die each year while 50 million are injured as a result of road accidents worldwide. Road accidents are the 10th leading cause of death globally. On current trends, road traffic accidents are to become the 7th leading cause of death by 2030 making it a major public health concern. Between the years 2005 and 2016, there were roughly 2 million road accidents reported in the United Kingdom (UK) alone of which 16,000 were fatal. As a big data project, I wanted to explore the traffic demographics data in greater detail using machine learning! CONTEXT The UK government amassed traffic data from 2004 to 2017, recording over 2 million accidents in the process and making this one of the most comprehensive traffic data sets out there. It's a huge picture of a country undergoing change. Note that all the contained accident data comes from police reports, so this data does not include minor incidents. For steps undertaken to pre-process and clean the data, please view the "Data Cleansing & Descriptive Analysis_UK Traffic Demographics.ipynb" file DESCRIPTIVE ANALYTICS (EDA) Tools used include Python, Tableau, MS PowerBI Percent (%) distribution of target classes Percent dist of Accident Severity As seen above, the data is highly imbalanced. For detailed steps undertaken to deal with the imbalanced data, please view the "Modelling_Predictive Analytics_UK Traffic Demographics.ipynb" file. This article provides some great tips on utilizing the correct performance metrics when analyzing a models performance trained on an imbalanced dataset. This article describes several strategies that can help combat the case of a severly imbalanced dataset. Methods include: Resampling strategies (under - Tomek Links, Cluster Centroids, over sampling - SMOTE) Using Decision Tree based models Using Cost-Sensitive training (Penalize algorithms) Number of accidents by Year and Accident Severity Total accidents by year and severity It can be seen above that the trend seems to be increasing as the years go. In addition, the spike between 2008 - 2009 was because of a enhancement in the reporting system introduced in the UK in 2009, where all accident including minor accidents needed to be reported by the police so as to match the counts represented by hospitals, insurance claims etc. Accidents density by Location geomap Most accidents took place in major cities - Birmingham, London, leeds, Newcastle Accidents by Gender and Age Accidents by gender and age Accidents by Day of the week and Year Accidents by year and weekday Most accidents take place on a Friday Vehicle Manoever at time of accident Vehicle Manoever at time of accident Most accidents take place as a result of overtaking For more findings, please go to the "Images" folder. For steps undertaken to carry out some predictive modeling and hyper-parameter tuning, please view the "Modelling_Predictive Analytics_UK Traffic Demographics.ipynb" file. RECOMMENDATIONS TO THE DEPARTMENT OF TRANSPORT (UK) Decrease emergency response times during afternoon rush-hours (15-19) especially on Fridays. Allocate resources to investigate high density traffic points and identify new infrastructure needs to divert traffic from dual-carriage ways. Explore conditions of vehicles and casualties such as vehicle type, age of vehicles registered, pedestrian movements, etc. for policy makers. Adopt comprehensive distracted driving laws that increase penalties for drivers who commit traffic violations like aggressive overtaking. ACKNOWLEDGEMENTS The license for this dataset is the Open Givernment Licence used by all data on data.gov.uk. The raw datasets are available from the UK Department of Transport website. I had a lot of fun working on this dataset and learned a lot in the process. I plan to further my research in the area of predictive modeling using imabalanced data and how to effectively build a highly robust model for future projects. About Here, I analyze the Road Safety and Traffic Demographics dataset (UK), containing accidents reported by the police between the years of 2004 - 2017. Topics accident-rate accident-severity imbalanced-data imbalanced-learning road-accident reported-accidents road-safety uk-government transport traffic-demographics severe-accidents pca classification Resources Readme Releases No releases published Packages No packages published Languages Jupyter Notebook 100.0% © 2020 GitHub, Inc.
vighneshsawant112004-debug
"This project features two interactive dashboards analyzing insurance claims by state, time, gender, and vehicle age. They deliver actionable insights into fraud detection, claim severity, and customer behavior, supporting both operational efficiency and strategic decision-making.
damaniayesh
This project provides the dashboard of the financial performance of claims made against the policies using PowerBI.
shareef99997
No description available
chanchalmahor
No description available
No description available
ramya217
Self-service Power BI dashboard analysing 50K insurance claims with RLS
Shazmeengithub
No description available
NevilPanchal
Power BI dashboard analyzing insurance claims, severity & customer risk.
hyungishim
Power BI dashboard analysing insurance premiums, coverage, and claim trends
SomasreeMazumder
├── data/ │ └── insurance.csv ├── sql/ │ └── analysis_queries.sql ├── screenshots/ │ ├── page1_executive.png │ ├── page2_risk.png │ ├── page3_demographics.png │ └── page4_claims.png └── README.md
TsinjoNantosoa
No description available
venkateshmtn
This project analyzes insurance claims data to identify claim trends, high-risk policy categories, revenue contribution, and customer segmentation patterns. The dashboard provides actionable insights for improving risk management and operational efficiency.
sherpanuren
No description available
Sujal8860
No description available
mamadou-data
Power BI dashboard analyzing 8,000 insurance claims with star schema modeling and advanced DAX KPIs
Kaif-077
No description available
Shubh2428
Power BI dashboard analyzing insurance claims, fraud detection and customer risk profiling using DAX and interactive visualizations.
Interactive Power BI dashboard analyzing insurance data — premiums, claims, policy types, and customer trends. Built using Power Query, DAX, and SQL to deliver business insights and optimize decision-making for insurance operations.
End-to-end Power BI solution for insurance risk and claims analysis, featuring demographic insights, vehicle risk profiling, KPI tracking, and interactive visualizations.
SyedHuzaifa12
No description available
akshata684
No description available
immanuel-DataX
Data analytics project using Power BI to analyze insurance claims in the hospitality and travel sector, including KPIs and claim trend insights.
A repository of insurance claims done by car owners over a period of time
danishr198
Power BI Dashboard analysis 10000 claim records with Multiple Charts.
akshayxgh
No description available
trvaishnavi1995
Insurance Claims Fraud Analysis using Excel and Power BI with DAX, Star Schema Modeling, and Interactive Dashboard.
vbquotex4-sys
Powerbi dashboard for insurance claims and premium amount for different insurance products
A comprehensive Power BI dashboard designed to analyze and visualize car insurance claims data. This project transforms raw policy and claims data into actionable insights, enabling risk assessment, trend identification, and data-driven decision-making for an insurance company.
goregulab223
No description available