Search Results

Found 192 repositories(showing 30)

HeartHealthPrediction

ammarmahmood1999

🧡66

The major reason for the death in worldwide is the heart disease in high and low developed countries. The data scientist uses distinctive machine learning techniques for modeling health diseases by using authentic dataset eﬃciently and accurately. The medical analysts are needy for the models or systems to predict the disease in patients before the strike. High cholesterol, unhealthy diet, harmful use of alcohol, high sugar levels, high blood pressure, and smoking are the main symptoms of chances of the heart attack in humans. Data Science is an advanced and enhanced method for the analysis and encapsulation of useful information. The attributes and variable in the dataset discover an unknown and future state of the model using prediction in machine learning. Chest pain, blood pressure, cholesterol, blood sugar, family history of heart disease, obesity, and physical inactivity are the chances that inﬂuence the possibility of heart diseases. This project emphasizes to evaluate diﬀerent algorithms for the diagnosis of heart disease with better accuracies by using the patient’s data set because predictions and descriptions are fundamental objectives of machine learning. Each procedure has unique perspective for the modeling objectives. Algorithms have been implemented for the prediction of heart disease with our Heart patient data set

108

Jupyter Notebook

Updated 1 day ago

data-sciencedecision-treeshealthcare+5

neurons

Aryia-Behroziuan

🧡65

An ANN is a model based on a collection of connected units or nodes called "artificial neurons", which loosely model the neurons in a biological brain. Each connection, like the synapses in a biological brain, can transmit information, a "signal", from one artificial neuron to another. An artificial neuron that receives a signal can process it and then signal additional artificial neurons connected to it. In common ANN implementations, the signal at a connection between artificial neurons is a real number, and the output of each artificial neuron is computed by some non-linear function of the sum of its inputs. The connections between artificial neurons are called "edges". Artificial neurons and edges typically have a weight that adjusts as learning proceeds. The weight increases or decreases the strength of the signal at a connection. Artificial neurons may have a threshold such that the signal is only sent if the aggregate signal crosses that threshold. Typically, artificial neurons are aggregated into layers. Different layers may perform different kinds of transformations on their inputs. Signals travel from the first layer (the input layer) to the last layer (the output layer), possibly after traversing the layers multiple times. The original goal of the ANN approach was to solve problems in the same way that a human brain would. However, over time, attention moved to performing specific tasks, leading to deviations from biology. Artificial neural networks have been used on a variety of tasks, including computer vision, speech recognition, machine translation, social network filtering, playing board and video games and medical diagnosis. Deep learning consists of multiple hidden layers in an artificial neural network. This approach tries to model the way the human brain processes light and sound into vision and hearing. Some successful applications of deep learning are computer vision and speech recognition.[68] Decision trees Main article: Decision tree learning Decision tree learning uses a decision tree as a predictive model to go from observations about an item (represented in the branches) to conclusions about the item's target value (represented in the leaves). It is one of the predictive modeling approaches used in statistics, data mining, and machine learning. Tree models where the target variable can take a discrete set of values are called classification trees; in these tree structures, leaves represent class labels and branches represent conjunctions of features that lead to those class labels. Decision trees where the target variable can take continuous values (typically real numbers) are called regression trees. In decision analysis, a decision tree can be used to visually and explicitly represent decisions and decision making. In data mining, a decision tree describes data, but the resulting classification tree can be an input for decision making. Support vector machines Main article: Support vector machines Support vector machines (SVMs), also known as support vector networks, are a set of related supervised learning methods used for classification and regression. Given a set of training examples, each marked as belonging to one of two categories, an SVM training algorithm builds a model that predicts whether a new example falls into one category or the other.[69] An SVM training algorithm is a non-probabilistic, binary, linear classifier, although methods such as Platt scaling exist to use SVM in a probabilistic classification setting. In addition to performing linear classification, SVMs can efficiently perform a non-linear classification using what is called the kernel trick, implicitly mapping their inputs into high-dimensional feature spaces. Illustration of linear regression on a data set. Regression analysis Main article: Regression analysis Regression analysis encompasses a large variety of statistical methods to estimate the relationship between input variables and their associated features. Its most common form is linear regression, where a single line is drawn to best fit the given data according to a mathematical criterion such as ordinary least squares. The latter is often extended by regularization (mathematics) methods to mitigate overfitting and bias, as in ridge regression. When dealing with non-linear problems, go-to models include polynomial regression (for example, used for trendline fitting in Microsoft Excel[70]), logistic regression (often used in statistical classification) or even kernel regression, which introduces non-linearity by taking advantage of the kernel trick to implicitly map input variables to higher-dimensional space. Bayesian networks Main article: Bayesian network A simple Bayesian network. Rain influences whether the sprinkler is activated, and both rain and the sprinkler influence whether the grass is wet. A Bayesian network, belief network, or directed acyclic graphical model is a probabilistic graphical model that represents a set of random variables and their conditional independence with a directed acyclic graph (DAG). For example, a Bayesian network could represent the probabilistic relationships between diseases and symptoms. Given symptoms, the network can be used to compute the probabilities of the presence of various diseases. Efficient algorithms exist that perform inference and learning. Bayesian networks that model sequences of variables, like speech signals or protein sequences, are called dynamic Bayesian networks. Generalizations of Bayesian networks that can represent and solve decision problems under uncertainty are called influence diagrams. Genetic algorithms Main article: Genetic algorithm A genetic algorithm (GA) is a search algorithm and heuristic technique that mimics the process of natural selection, using methods such as mutation and crossover to generate new genotypes in the hope of finding good solutions to a given problem. In machine learning, genetic algorithms were used in the 1980s and 1990s.[71][72] Conversely, machine learning techniques have been used to improve the performance of genetic and evolutionary algorithms.[73] Training models Usually, machine learning models require a lot of data in order for them to perform well. Usually, when training a machine learning model, one needs to collect a large, representative sample of data from a training set. Data from the training set can be as varied as a corpus of text, a collection of images, and data collected from individual users of a service. Overfitting is something to watch out for when training a machine learning model. Federated learning Main article: Federated learning Federated learning is an adapted form of distributed artificial intelligence to training machine learning models that decentralizes the training process, allowing for users' privacy to be maintained by not needing to send their data to a centralized server. This also increases efficiency by decentralizing the training process to many devices. For example, Gboard uses federated machine learning to train search query prediction models on users' mobile phones without having to send individual searches back to Google.[74] Applications There are many applications for machine learning, including: Agriculture Anatomy Adaptive websites Affective computing Banking Bioinformatics Brain–machine interfaces Cheminformatics Citizen science Computer networks Computer vision Credit-card fraud detection Data quality DNA sequence classification Economics Financial market analysis[75] General game playing Handwriting recognition Information retrieval Insurance Internet fraud detection Linguistics Machine learning control Machine perception Machine translation Marketing Medical diagnosis Natural language processing Natural language understanding Online advertising Optimization Recommender systems Robot locomotion Search engines Sentiment analysis Sequence mining Software engineering Speech recognition Structural health monitoring Syntactic pattern recognition Telecommunication Theorem proving Time series forecasting User behavior analytics In 2006, the media-services provider Netflix held the first "Netflix Prize" competition to find a program to better predict user preferences and improve the accuracy of its existing Cinematch movie recommendation algorithm by at least 10%. A joint team made up of researchers from AT&T Labs-Research in collaboration with the teams Big Chaos and Pragmatic Theory built an ensemble model to win the Grand Prize in 2009 for $1 million.[76] Shortly after the prize was awarded, Netflix realized that viewers' ratings were not the best indicators of their viewing patterns ("everything is a recommendation") and they changed their recommendation engine accordingly.[77] In 2010 The Wall Street Journal wrote about the firm Rebellion Research and their use of machine learning to predict the financial crisis.[78] In 2012, co-founder of Sun Microsystems, Vinod Khosla, predicted that 80% of medical doctors' jobs would be lost in the next two decades to automated machine learning medical diagnostic software.[79] In 2014, it was reported that a machine learning algorithm had been applied in the field of art history to study fine art paintings and that it may have revealed previously unrecognized influences among artists.[80] In 2019 Springer Nature published the first research book created using machine learning.[81] Limitations Although machine learning has been transformative in some fields, machine-learning programs often fail to deliver expected results.[82][83][84] Reasons for this are numerous: lack of (suitable) data, lack of access to the data, data bias, privacy problems, badly chosen tasks and algorithms, wrong tools and people, lack of resources, and evaluation problems.[85] In 2018, a self-driving car from Uber failed to detect a pedestrian, who was killed after a collision.[86] Attempts to use machine learning in healthcare with the IBM Watson system failed to deliver even after years of time and billions of dollars invested.[87][88] Bias Main article: Algorithmic bias Machine learning approaches in particular can suffer from different data biases. A machine learning system trained on current customers only may not be able to predict the needs of new customer groups that are not represented in the training data. When trained on man-made data, machine learning is likely to pick up the same constitutional and unconscious biases already present in society.[89] Language models learned from data have been shown to contain human-like biases.[90][91] Machine learning systems used for criminal risk assessment have been found to be biased against black people.[92][93] In 2015, Google photos would often tag black people as gorillas,[94] and in 2018 this still was not well resolved, but Google reportedly was still using the workaround to remove all gorillas from the training data, and thus was not able to recognize real gorillas at all.[95] Similar issues with recognizing non-white people have been found in many other systems.[96] In 2016, Microsoft tested a chatbot that learned from Twitter, and it quickly picked up racist and sexist language.[97] Because of such challenges, the effective use of machine learning may take longer to be adopted in other domains.[98] Concern for fairness in machine learning, that is, reducing bias in machine learning and propelling its use for human good is increasingly expressed by artificial intelligence scientists, including Fei-Fei Li, who reminds engineers that "There’s nothing artificial about AI...It’s inspired by people, it’s created by people, and—most importantly—it impacts people. It is a powerful tool we are only just beginning to understand, and that is a profound responsibility.”[99] Model assessments Classification of machine learning models can be validated by accuracy estimation techniques like the holdout method, which splits the data in a training and test set (conventionally 2/3 training set and 1/3 test set designation) and evaluates the performance of the training model on the test set. In comparison, the K-fold-cross-validation method randomly partitions the data into K subsets and then K experiments are performed each respectively considering 1 subset for evaluation and the remaining K-1 subsets for training the model. In addition to the holdout and cross-validation methods, bootstrap, which samples n instances with replacement from the dataset, can be used to assess model accuracy.[100] In addition to overall accuracy, investigators frequently report sensitivity and specificity meaning True Positive Rate (TPR) and True Negative Rate (TNR) respectively. Similarly, investigators sometimes report the false positive rate (FPR) as well as the false negative rate (FNR). However, these rates are ratios that fail to reveal their numerators and denominators. The total operating characteristic (TOC) is an effective method to express a model's diagnostic ability. TOC shows the numerators and denominators of the previously mentioned rates, thus TOC provides more information than the commonly used receiver operating characteristic (ROC) and ROC's associated area under the curve (AUC).[101] Ethics Machine learning poses a host of ethical questions. Systems which are trained on datasets collected with biases may exhibit these biases upon use (algorithmic bias), thus digitizing cultural prejudices.[102] For example, using job hiring data from a firm with racist hiring policies may lead to a machine learning system duplicating the bias by scoring job applicants against similarity to previous successful applicants.[103][104] Responsible collection of data and documentation of algorithmic rules used by a system thus is a critical part of machine learning. Because human languages contain biases, machines trained on language corpora will necessarily also learn these biases.[105][106] Other forms of ethical challenges, not related to personal biases, are more seen in health care. There are concerns among health care professionals that these systems might not be designed in the public's interest but as income-generating machines. This is especially true in the United States where there is a long-standing ethical dilemma of improving health care, but also increasing profits. For example, the algorithms could be designed to provide patients with unnecessary tests or medication in which the algorithm's proprietary owners hold stakes. There is huge potential for machine learning in health care to provide professionals a great tool to diagnose, medicate, and even plan recovery paths for patients, but this will not happen until the personal biases mentioned previously, and these "greed" biases are addressed.[107] Hardware Since the 2010s, advances in both machine learning algorithms and computer hardware have led to more efficient methods for training deep neural networks (a particular narrow subdomain of machine learning) that contain many layers of non-linear hidden units.[108] By 2019, graphic processing units (GPUs), often with AI-specific enhancements, had displaced CPUs as the dominant method of training large-scale commercial cloud AI.[109] OpenAI estimated the hardware compute used in the largest deep learning projects from AlexNet (2012) to AlphaZero (2017), and found a 300,000-fold increase in the amount of compute required, with a doubling-time trendline of 3.4 months.[110][111] Software Software suites containing a variety of machine learning algorithms include the following: Free and open-source so

Updated 4 days ago

an-annandroidartificial-intelligence+8

predicting-Paid-amount-for-Claims-Data

Nemshan

❤️40

Introduction The context is the 2016 public use NH medical claims files obtained from NH CHIS (Comprehensive Health Care Information System). The dataset contains Commercial Insurance claims, and a small fraction of Medicaid and Medicare payments for dually eligible people. The primary purpose of this assignment is to test machine learning (ML) skills in a real case analysis setting. You are expected to clean and process data and then apply various ML techniques like Linear and no linear models like regularized regression, MARS, and Partitioning methods. You are expected to use at least two of R, Python and JMP software. Data details: Medical claims file for 2016 contains ~17 millions rows and ~60 columns of data, containing ~6.5 million individual medical claims. These claims are all commercial claims that were filed by healthcare providers in 2016 in the state of NH. These claims were ~88% for residents of NH and the remaining for out of state visitors who sought care in NH. Each claim consists of one or more line items, each indicating a procedure done during the doctor’s visit. Two columns indicating Billed amount and the Paid amount for the care provided, are of primary interest. The main objective is to predict “Paid amount per procedure” by mapping a plethora of features available in the dataset. It is also an expectation that you would create new features using the existing ones or external data sources. Objectives: Step 1: Take a random sample of 1 million unique claims, such that all line items related to each claim are included in the sample. This will result in a little less than 3 million rows of data. Step 2: Clean up the data, understand the distributions, and create new features if necessary. Step 3: Run predictive models using validation method of your choice. Step 4: Write a descriptive report (less than 10 pages) describing the process and your findings.

Jupyter Notebook

Updated 2 months ago

earthlasso-regressionpredictive-modeling+3

GP-Data_Analysis_and_ML

abdallahkhairy

🧡55

Human locomotion affects our daily living activities. Losing limbs or having neurological disorders with motor deficits could affect the quality of life. Gait analysis is a systematic study of human locomotion, which is defined as body movements through aerial, aquatic, or terrestrial space. This analysis has been used to study people ambulation, registration, and reconstruction of physical location and orientation of individual limbs used to quantify and characterize human locomotion using different gait parameters including gait activities such as walking, stairs ascending/descending, … etc., phases, and spatiotemporal parameters of human gait. Additionally, gait analysis parameters can be used to evaluate the functionality of patients and wearable system users. The evaluation is based on patient's stability, energy consumption, gait symmetry, ability to recover from perturbations, and ability to perform activities of daily living. Many companies develop assistive, wearable, and rehabilitation devices for patients with lower limb neurological disorders. These devices are tested and evaluated inside controlled lab environments. However, they don’t have enough data on the patient's performance in real world and harsh environments. Collecting large datasets of device users and their gait performance data in real environment are notoriously difficult. Additionally, collecting data on less prevalent or on gait activities other than level walking, stair ascending/descending, sitting, standing, …etc. on hard surfaces is rarely attempted. However, the scope for collecting gait data from alternative sources other than traditional gait labs could be attained with the help of IoT data collection embedded on the wearable and assistive devices and well-established cloud platforms equipped with big-data analytics and data visualization capabilities. This project aims to develop a cloud platform capable of collect data from wearable and assistive devices such as prostheses, exoskeleton, gait analysis wearable sensors, …etc. using IoT technologies. This platform is capable of automatically use data mining and visualization tools. Additionally, it uses statistical and machine learning techniques to estimate gait events, gait symmetry, gait speed, gait activities, stability, energy consumption, …etc. Also, it is capable of predicting patient's progress over time. The project will be composed of two major components, hardware component and software component. In hardware component, the students will design and implement the IoT that collects the different readings for gait analysis and send them to the cloud. Meanwhile, in the software component, the students will design and implement a set of algorithms to visualize the collected data, then design and implement data analytics to automatically analyze the collected data, so that we can estimate gait events, gait symmetry, gait speed, classify gait activities, stability, energy consumption, …etc. and predicting patient's progress over time. By analyzing the collected data, the patient's progress can be predicted over time. Additionally, these data can be used through manufacturers of prostheses legs to improve their products, as well as through health-care centers to assess the patient's performance. The following figures describe the main modules of our graduation project.

Python

Updated 2 weeks ago

VA

surayudu

❤️35

Overview Virtual Assistant is an application program that understands natural language voice commands or text commands and completes the tasks for users. Virtual Assistants features a human interface system, they can understand the language and meaning of what the user is saying and have built in replies. Learn from different instances so that they can have a long term human interaction. It uses artificial intelligence to learn things from different situations. Using AI they can recognize, predict and classify based on analysis. Purpose Virtual Assistant provides various services. It is ready to help wherever you are and can be deployed in your devices. Wider scope and perform users to get answers to their questions and perform tasks using voice or text commands, all in an interactive form. Precise voice and text recognition with the ability to have conversation with the users. In case of Google assistant, they recognize the voice of the user and perform the specific task. Use case Customer support: Rather of customers waiting for a long to solve an issue, the can get instant support from chatbot, Banking Chatbots: Personalized banking with an aim to improve customer satisfaction and engagement. Project support: Can send notifications for various tasks. Reminder to follow up with an action. HR assistants: Can help employees register time off, retrieve company policies, and find answers to repetitive employment questions. Teaching: Can helps teachers to create more detailed learning plans and materials. Being full-blown health assistants: Virtual assistants can do so much more than giving tips, they can often help patients apply simple treatments, remind them to take medicine, and monitor their health. Automating FAQs and administrative tasks: If there's a scenario where the customers have dozens of repetitive questions, virtual assistant is there 24/7 to answer questions from people who may be anxious to get answers. Technical support: The customer has a product technical error, in this case, asks the customer to type the error they encounter, then it generates a dynamic link to search the customer input words in the technical knowledge repositories and guide the customer through his search. Efficient Processes: Make processes more streamlined and transparent by synchronizing between functions, roles, and departments. Booking: A virtual assistant can respond to a consumer through messages, web, SMS or email and update them on the status of their existing reservation, make changes to the reservation, process related payments or refunds, send proactive notifications and provide detailed information on their itinerary. Features a. NLP Text Search : Virtual assistant concentrates on NLP and NLU. Understands the slang that is used in everyday conversation and analyses the sentiments to enhance a better set of communication. b. FAQ voice assistant : FAQ voice assistant is a voice assistant that provides a list of questions and answers relating to a particular subject. c. Conversations voice assistant : Conversations voice assistant is a voice assistant that provides conversational services based on a subject. d. Speech conversations (STT,TTS) : It provides conversational services such as speech to text and text to speech. e. Integration with Enterprise Systems : It provides administrative service to clients. Such as scheduling appointments, making phone calls, making travel arrangements, managing email accounts etc. f. Rich Conversations : Rich conversation is a conversation that can use different features such as images, videos, buttons, forms etc. a) Images:Imagescanbesentorreceivedduringconversations. b) Buttons:Buttonscanprovidedifferentfunctionalitiesasperthefeatureofthebutton. c) Videos:Videoscanbesentorreceivedduringconversations d) Forms: Forms help to give visible shape or configuration of something. Technical Requirement g. HTML5 h. JavaScript i. Python (Flask API, NLP Packages) j. MySQL k. Docker l. Git

Updated 6 months ago

food-risk-detection-ml

austinLorenzMccoy

❤️45

The Food Adulteration Detection System is a comprehensive data science and machine learning platform designed to identify, analyze, and predict health risks associated with food product adulteration. This system combines state-of-the-art machine learning techniques with a modern web API to provide real-time analysis and prediction capabilities.

Jupyter Notebook

Updated 1 month ago

An-algorithmic-approach-for-pollution-monitoring-and-predicting

vimalraj-76

❤️40

Over the past few decades, due to human activities, industrialization, and urbanization, air pollution has become a life-threatening factor in many countries around the world. Air pollution has a direct impact on human health through the exposure of pollutants and particulates, which has increased the interest in air pollution and its impacts among the scientific community. The main causes associated with air pollution are the burning of fossil fuels, agriculture, exhaust from factories and industries, residential heating, and natural disasters. The Environmental Protection Agency (EPA) tracks the pollution level by calculating the amount of ground-level ozone (O3), Sulphur dioxide (SO2), particulates matter (PM10 and PM2.5), carbon monoxide (CO), carbon dioxide (CO2), and nitrogen dioxide (NO2) present in the air molecule. These substances are in compositions of a common index, called the Air Quality Index (AQI), indicating how clean or polluted the air is currently or forecasted to become in areas. This work deliberates the implementation of cloud based IoT system for air quality monitoring in which the sensors are used to calculate CO, PM2.5 and PM10, O3, SO2 and NOx pollution level with environmental condition like temperature and humidity. The obtained information can be updated in cloud platform using Lora nodes and Lora Gateway. The information fetched from the cloud is transmitted to the Machine learning models which contains the detailed dataset for the pollutants and these models accurately predict the day-wise pollutant concentrations and display them using an application. This work presents the detailed analysis for predicting the cause of pollution by using Support Vector Machine (SVM), Random forest algorithm and K-nearest neighbors (KNN) algorithm.

Apache-2.0

Jupyter Notebook

Updated 7 months ago

DataMining

ioangatop

❤️40

Data Mining techniques on predicting mood using time-series analysis in mental health and predicting the click behaviour of users in a hotel ranking system.

MIT

Jupyter Notebook

Updated 2 years ago

data-miningdata-scienceprediction+3

Multiple-Disease-Prediction-System

verma-tanishq

❤️40

A lot of analysis over existing systems in the health care industry considered only one disease at a time. For example, one system is used to analysediabetes, another is used to analyse diabetes retinopathy, and another system is used to predict heart disease. Maximum systems focus on a particular disease. When an organization wants to analyse their patient’s health reports then they have to deploy many models. The approach in the existing system is useful to analyse only particular diseases. In multiple diseases prediction system a user can analyse more than one disease on a single website. The user doesn’t need to traverse different places in order to predict whether he/she has a particular disease or not. In multiple diseases prediction system, the user needs to select the name of the particular disease, enter its parameters and just click on submit. The corresponding machine learning model will be invoked and it would predict the output and display it on the screen.

MIT

Jupyter Notebook

Updated 2 years ago

Predicting-car-accidents-report.lpynb

amimba09

❤️35

ANALYZING ROAD SAFETY & TRAFFIC DEMOGRAPHICS IN THE UK (Multi-class Classification) SUMMARY Here, I am aim to analyze the Road Safety and Traffic Demographics dataset (UK), containing accidents reported by the police between the years of 2004 - 2017. PROJECT GOALS: Identify factors responsible for most of the reported accidents. Build a machine learning model that is capable of accurately predicting the severity of an accident. Provide recommendations to the Department of Transport (UK Government), to improve road safety policies and prevent recurrences of severe accidents where possible. PACKAGES USED: Scikit-learn, numpy, pandas, imblearn (imbalanced-learn), seaborn, Matplotlib MOTIVATION World Health Organization (WHO) reported that more than 1.25 million people die each year while 50 million are injured as a result of road accidents worldwide. Road accidents are the 10th leading cause of death globally. On current trends, road traffic accidents are to become the 7th leading cause of death by 2030 making it a major public health concern. Between the years 2005 and 2016, there were roughly 2 million road accidents reported in the United Kingdom (UK) alone of which 16,000 were fatal. As a big data project, I wanted to explore the traffic demographics data in greater detail using machine learning! CONTEXT The UK government amassed traffic data from 2004 to 2017, recording over 2 million accidents in the process and making this one of the most comprehensive traffic data sets out there. It's a huge picture of a country undergoing change. Note that all the contained accident data comes from police reports, so this data does not include minor incidents. For steps undertaken to pre-process and clean the data, please view the "Data Cleansing & Descriptive Analysis_UK Traffic Demographics.ipynb" file DESCRIPTIVE ANALYTICS (EDA) Tools used include Python, Tableau, MS PowerBI Percent (%) distribution of target classes Percent dist of Accident Severity As seen above, the data is highly imbalanced. For detailed steps undertaken to deal with the imbalanced data, please view the "Modelling_Predictive Analytics_UK Traffic Demographics.ipynb" file. This article provides some great tips on utilizing the correct performance metrics when analyzing a models performance trained on an imbalanced dataset. This article describes several strategies that can help combat the case of a severly imbalanced dataset. Methods include: Resampling strategies (under - Tomek Links, Cluster Centroids, over sampling - SMOTE) Using Decision Tree based models Using Cost-Sensitive training (Penalize algorithms) Number of accidents by Year and Accident Severity Total accidents by year and severity It can be seen above that the trend seems to be increasing as the years go. In addition, the spike between 2008 - 2009 was because of a enhancement in the reporting system introduced in the UK in 2009, where all accident including minor accidents needed to be reported by the police so as to match the counts represented by hospitals, insurance claims etc. Accidents density by Location geomap Most accidents took place in major cities - Birmingham, London, leeds, Newcastle Accidents by Gender and Age Accidents by gender and age Accidents by Day of the week and Year Accidents by year and weekday Most accidents take place on a Friday Vehicle Manoever at time of accident Vehicle Manoever at time of accident Most accidents take place as a result of overtaking For more findings, please go to the "Images" folder. For steps undertaken to carry out some predictive modeling and hyper-parameter tuning, please view the "Modelling_Predictive Analytics_UK Traffic Demographics.ipynb" file. RECOMMENDATIONS TO THE DEPARTMENT OF TRANSPORT (UK) Decrease emergency response times during afternoon rush-hours (15-19) especially on Fridays. Allocate resources to investigate high density traffic points and identify new infrastructure needs to divert traffic from dual-carriage ways. Explore conditions of vehicles and casualties such as vehicle type, age of vehicles registered, pedestrian movements, etc. for policy makers. Adopt comprehensive distracted driving laws that increase penalties for drivers who commit traffic violations like aggressive overtaking. ACKNOWLEDGEMENTS The license for this dataset is the Open Givernment Licence used by all data on data.gov.uk. The raw datasets are available from the UK Department of Transport website. I had a lot of fun working on this dataset and learned a lot in the process. I plan to further my research in the area of predictive modeling using imabalanced data and how to effectively build a highly robust model for future projects. About Here, I analyze the Road Safety and Traffic Demographics dataset (UK), containing accidents reported by the police between the years of 2004 - 2017. Topics accident-rate accident-severity imbalanced-data imbalanced-learning road-accident reported-accidents road-safety uk-government transport traffic-demographics severe-accidents pca classification Resources Readme Releases No releases published Packages No packages published Languages Jupyter Notebook 100.0% © 2020 GitHub, Inc.

Updated 1 year ago

Predictive-Health-System-Analysis

RudranshKaran

❤️40

A multi-agent predictive healthcare system that leverages CBC biomarker analysis and regional trend insights to detect diseases like anemia, dengue, and sepsis. It integrates patient health summaries, disease inference, and outbreak monitoring for proactive and intelligent medical response.

JavaScript

Updated 1 week ago

project_musicXmentalHealth

victorsanz-vs

❤️35

Project done in R "Music X Mental Health": Exploratory analysis, predictive models, entropy and recommendation system.

Updated 6 months ago

TimeSeriesEnergy

kishan-rajoria

❤️35

Time Series Analysis for Predictive Maintenance in electrical systems. The repository features algorithms for anomaly detection, load forecasting, and equipment health monitoring. It emphasizes the importance of predictive analytics in preventing failures and optimizing maintenance schedules.

Jupyter Notebook

Updated 1 year ago

SmartTyrePulse-trial

aayushraghav93

❤️35

SMART INDIA HACKATHON PROJECT---Smart Tyre Pulse is an AI and IoT-powered tyre monitoring system for mining dumpers, enabling real-time health tracking, TKPH analysis, and predictive maintenance.

Dart

Updated 4 months ago

Stress-Stage-Prediction-using-Machine-Learning

AyushiBhatla

❤️35

This project predicts an individual's Stress Stage (Low, Moderate, High) using Machine Learning techniques based on features like mental health scores, lifestyle habits, and demographic information. The project leverages data preprocessing, exploratory data analysis, model training, and evaluation to build a robust stress classification system.

Jupyter Notebook

Updated 9 months ago

Omdena_Sri_Lanka_Predicting_Dengue_Fever_Outbreak

surajkarki66

❤️20

As part of an Omdena project, I contributed to developing an AI-driven system to predict dengue fever outbreaks in Sri Lanka. By applying time series analysis and machine learning techniques, the project aimed to enhance early warning capabilities, enabling more efficient public health responses and potentially saving lives.

Jupyter Notebook

Updated 1 year ago

Diabetes

ferzaad

❤️35

Machinery learning is a fast-expanding area that will change the method for the diagnosis and management of this chronic condition by applying itself to diabetes as a global pandemic. Machine learning principles have been used to build algorithms to help predictive models of the likelihood of diabetes development or related complications. Digital therapy has shown to be a well-established lifestyle care intervention for diabetes control. Patients are becoming more self-managed, and the assistance of therapeutic decision-making is available to both them and health care practitioners. Machine learning helps patient signs and bio-markers to persist, unburdened, remotely controlled. Social networking and online forums also increase patient commitment to the treatment of diabetes. Development in technologies helped to optimize the use of diabetes tools. These smart technological reforms together have led to an improved glycemic regulation, a decrease in fast glucose and glycosylated hemoglobin levels. Machine learning introduces a change in diabetes treatment model from traditional management techniques to data-driven care growth The trouble with medicines is that various drug formulations can cure the condition in several ways. As the diabetic population grows, new medications are increasingly emerging. In order to treat common diseases such as elevated cholesterol and high blood pressure, diabetics also continue to take other drugs. With the patient's age and other physical conditions, the potency of these medicines varies In this method, the effectiveness, risks of side effects and costs are measured side by side, and are readily grasped by doctors and patients. The most prevalent form of Type 2 diabetes effects more people as people grow up. This disease has also escalated dramatically due to the spread of western diets and lifestyles to developing countries. Diabetes is an incurable metabolic illness that happens when high blood sugar is present, and may have deadly effects. Today, medicine, nutritious diets and exercise will regulate diabetes. It is also unpredictable to choose and administer the most appropriate mixture of prescription, which is stable, cheap and well tolerated by patients as well By applying an adequate methodology for the design and development of systems experts can achieve objectives satisfactorily, as in the case of the Weiss and Kuligowski methodology. On the other hand, machine learning has several knowledge machine algorithms, which can be useful to be applied on various data sets through the different interfaces that offers, as the option of Explorer and Datasets, which were worked in this case of study, or to be included in other applications. Furthermore, both tools, contain what is necessary to conduct data transformations, grouping, regression, clustering, correlation and visualization tasks. Because they are designed as extensibility-oriented tools which allows to add new functionalities to a project, because it can be combined with other programming languages such as Prolog, for generation more robust expert systems Readmitted diabetes patients Machine learning techniques allow to automatically identify patterns and even make predictions based on a large amount of data that could be extracted from the computer systems used to ascertain information on readmission of diabetes patients. The analysis Clustering or grouping is a technique that allows exploring a setoff objects to determine if there are groups that can be significantly represented by certain characteristics, in this way, objects of the same group are very similar to each other and different from objects in other groups. The results obtained by comparing the relevance of different attributes as well as the use of two of the most popular algorithms in the world of machine learning are presented: neural networks and decision trees. Automatic classification of blood glucose measurements will allow specialists to prescribe a more accurate treatment based on the information obtained directly from the patients' glucometer (Hosseini et al, 2020). Thus, it contributes to the development of automatic decision support systems for gestational diabetes. This high level of glucose in the blood is transferred to the fetus causing various disorders: excessive growth of adipose tissues, which increases the need for caesarean section, neonatal hypoglycemia and increased risk of intrauterine fetal death (Dagliati et al, 2018). It also increases the risk of type 2 diabetes once the gestation period is over for both the mother and the fetus. The project proposes the development of intelligent and educational tools for the survey based on neurodiffuse techniques integrated into a telemedicine system. Telemedicine systems have been used with success on numerous occasions in diabetes and the integration of decision support tools in this type of system helps a better interpretation of the data (Abhari et al, 2019).

Updated 1 year ago

this_is_my_project

shivamgupta847680-star

❤️45

Engine Health Prediction system using vibration analysis and machine learning to detect faults and predict maintenance need

HTML

Updated 1 month ago

astronaut_health_monitoring

chinmai-sd-123

🧡50

Real-time AI-powered astronaut health monitoring system featuring predictive risk analysis, anomaly detection, and live telemetry dashboards using React, Node.js, and FastAPI

MIT

JavaScript

Updated 2 months ago

Patient-Heath-Analysis-Tool

Isha-singh-01

❤️35

A comprehensive AI-driven health analysis and wellness guidance tool. The system is designed to predict potential diseases from patient health data and provide personalized lifestyle recommendations.

Jupyter Notebook

Updated 4 months ago

CardioCare

sanjai-11

❤️40

CardioCare is a heart disease management system designed to monitor, analyze, and predict heart health. By utilizing patient health records, predictive analytics, and risk analysis models, it enables medical professionals and hospital administrators to track and manage heart-related conditions effectively

MIT

HTML

Updated 11 months ago

Predictive-Healthcare-Analysis

adityat9975

❤️35

The Predictive Healthcare Data Analysis System predicts health risks for heart disease, diabetes, and Parkinson’s disease using machine learning. It categorizes risk levels as low, moderate, or high and provides personalized diet plans, exercise routines, and calorie recommendations to help users manage their health effectively.

Jupyter Notebook

Updated 3 months ago

Machine-Learning-project-

Ganesh8217

❤️35

This project predicts diabetes using machine learning. It uses a Support Vector Classifier (SVC) model trained on a health-related dataset. The notebook covers the full workflow from data analysis and preprocessing to model evaluation and a simple predictive system.

Jupyter Notebook

Updated 4 months ago

-Wireless-Stress-Detection-and-Health-Monitoring-Using-Python-and-Machine-learing

BalajiPrasadThulasiram

❤️35

Developed a stress detection system using Python and machine learning to analyze real-time physiological data such as ECG and heart rate. Designed predictive models to classify stress levels, enabling efficient health monitoring through data processing and analysis

Jupyter Notebook

Updated 6 months ago

vitalguard-ai

FirasKahlaoui

🧡50

vitalguard-ai is a smart patient monitoring system built on esp32 and iot sensors, enhanced with artificial intelligence for real-time monitoring, daily health report generation, anomaly detection, and predictive risk analysis, with optional computer vision-based fall detection.

MIT

Updated 1 month ago

artificial-intelligencebiomedical-sensorscomputer-vision+8

Neurosync

sadiya-siddiqui

🧡55

NeuroSync AI is a predictive mental health system that analyzes user mood inputs and forecasts stress levels before escalation. Unlike reactive wellness apps, it uses AI-driven sentiment analysis to detect early signs of stress and provide proactive suggestions.

JavaScript

Updated 1 week ago

VocalMind

CamiloAndresDG

❤️40

A machine learning-based system that detects depression and anxiety through voice analysis, leveraging audio spectrograms and additional features to predict emotional states. Designed for early detection and mental health support.

MIT

Python

Updated 7 months ago

LifeLink-Mobile-and-Web-application

Danuja-Dewnith

🧡65

LifeLink is an AI-integrated smart blood and organ donation system (mobile & web) that enables real-time donor-recipient matching, emergency alerts, and predictive health analysis. It leverages AI, GPS, and cloud technologies to improve response time and save lives.

Dart

Updated 5 days ago

Predictive-Analytics-System-for-Healthcare

Sodiq-Shodimu

❤️35

Developed and implemented a predictive analytics system for healthcare to forecast patient outcomes Integrated data from electronic health records (EHR) and other sources for comprehensive analysis Enhanced operational efficiency through data-driven decision-making, resulting in resource optimization and cost savings

Python

Updated 4 months ago

Multiple_Disease_Prediction

AvijitChowdhury

❤️35

Many of the existing machine learning models for health care analysis are concentrating on one disease per analysis. For one analysis is for diabetes analysis, one for cancer analysis, and one for skin diseases like that. There is no common system where one analysis can perform more than one disease prediction. This Model is for predicting those diseases.

Jupyter Notebook

Updated 3 years ago

mlnumpypandas+4

GitHub Explorer

Search Results

HeartHealthPrediction

neurons

predicting-Paid-amount-for-Claims-Data

GP-Data_Analysis_and_ML

VA

food-risk-detection-ml

An-algorithmic-approach-for-pollution-monitoring-and-predicting

DataMining

Multiple-Disease-Prediction-System

Predicting-car-accidents-report.lpynb

Predictive-Health-System-Analysis

project_musicXmentalHealth

TimeSeriesEnergy

SmartTyrePulse-trial

Stress-Stage-Prediction-using-Machine-Learning

Omdena_Sri_Lanka_Predicting_Dengue_Fever_Outbreak

Diabetes

this_is_my_project

astronaut_health_monitoring

Patient-Heath-Analysis-Tool

CardioCare

Predictive-Healthcare-Analysis

Machine-Learning-project-

-Wireless-Stress-Detection-and-Health-Monitoring-Using-Python-and-Machine-learing

vitalguard-ai

Neurosync

VocalMind

LifeLink-Mobile-and-Web-application

Predictive-Analytics-System-for-Healthcare

Multiple_Disease_Prediction

HeartHealthPrediction

neurons

predicting-Paid-amount-for-Claims-Data

GP-Data_Analysis_and_ML

VA

food-risk-detection-ml

An-algorithmic-approach-for-pollution-monitoring-and-predicting

DataMining

Multiple-Disease-Prediction-System

Predicting-car-accidents-report.lpynb

Predictive-Health-System-Analysis

project_musicXmentalHealth

TimeSeriesEnergy

SmartTyrePulse-trial

Stress-Stage-Prediction-using-Machine-Learning

Omdena_Sri_Lanka_Predicting_Dengue_Fever_Outbreak

Diabetes

this_is_my_project

astronaut_health_monitoring

Patient-Heath-Analysis-Tool

CardioCare

Predictive-Healthcare-Analysis

Machine-Learning-project-

-Wireless-Stress-Detection-and-Health-Monitoring-Using-Python-and-Machine-learing

vitalguard-ai

Neurosync

VocalMind

LifeLink-Mobile-and-Web-application

Predictive-Analytics-System-for-Healthcare

Multiple_Disease_Prediction