Found 8 repositories(showing 8)
# Breast-cancer-risk-prediction > Necessity, who is the mother of invention. – Plato* ## Welcome to my GitHub repository on Using Predictive Analytics model to diagnose breast cancer. --- ### Objective: The repository is a learning exercise to: * Apply the fundamental concepts of machine learning from an available dataset * Evaluate and interpret my results and justify my interpretation based on observed data set * Create notebooks that serve as computational records and document my thought process. The analysis is divided into four sections, saved in juypter notebooks in this repository 1. Identifying the problem and Data Sources 2. Exploratory Data Analysis 3. Pre-Processing the Data 4. Build model to predict whether breast cell tissue is malignant or Benign ### [Notebook 1](https://github.com/ShiroJean/Breast-cancer-risk-prediction/blob/master/NB1_IdentifyProblem%2BDataClean.ipynb): Identifying the problem and Getting data. **Notebook goal:Identify the types of information contained in our data set** In this notebook I used Python modules to import external data sets for the purpose of getting to know/familiarize myself with the data to get a good grasp of the data and think about how to handle the data in different ways. ### [Notebook 2](https://github.com/ShiroJean/Breast-cancer-risk-prediction/blob/master/NB2_ExploratoryDataAnalysis.ipynb) Exploratory Data Analysis **Notebook goal: Explore the variables to assess how they relate to the response variable** In this notebook, I am getting familiar with the data using data exploration and visualization techniques using python libraries (Pandas, matplotlib, seaborn. Familiarity with the data is important which will provide useful knowledge for data pre-processing) ### [Notebook 3](https://github.com/ShiroJean/Breast-cancer-risk-prediction/blob/master/NB3_DataPreprocesing.ipynb) Pre-Processing the data **Notebook goal:Find the most predictive features of the data and filter it so it will enhance the predictive power of the analytics model.** In this notebook I use feature selection to reduce high-dimension data, feature extraction and transformation for dimensionality reduction. This is essential in preparing the data before predictive models are developed. ### [Notebook 4](https://github.com/ShiroJean/Breast-cancer-risk-prediction/blob/master/NB4_PredictiveModelUsingSVM.ipynb) Predictive model using Support Vector Machine (svm) **Notebook goal: Construct predictive models to predict the diagnosis of a breast tumor.** In this notebook, I construct a predictive model using SVM machine learning algorithm to predict the diagnosis of a breast tumor. The diagnosis of a breast tumor is a binary variable (benign or malignant). I also evaluate the model using confusion matrix the receiver operating curves (ROC), which are essential in assessing and interpreting the fitted model. ### [Notebook 5](https://github.com/ShiroJean/Breast-cancer-risk-prediction/blob/master/NB_5%20OptimizingSVMClassifier.ipynb): Optimizing the Support Vector Classifier **Notebook goal: Construct predictive models to predict the diagnosis of a breast tumor.** In this notebook, I aim to tune parameters of the SVM Classification model using scikit-learn.
hrishitelang
Cancer is a collection of related diseases, in which some of the body’s cells begin to divide without stopping and spread into surrounding tissues. Regardless of the view of cancer may be, it is exaggerated and over-generalized. While a diagnosis of cancer may still leave patients feeling helpless and out of control, in many cases today there is cause for hope rather than a blinkered vision of survival. The basic aim of our project is to ensure that patients with a risk or borderline edge of getting cancer shall get themselves digitally scanned, that would eventually generate a report. This report shall achieve in alluding convoluted details regarding certain possible properties of tumours that could be sent for prediction so that they could immediately diagnose it if at all it is predicted to be malignant. The importance of classifying cancer patients into high or low-risk groups has led many research teams, from the biomedical and the bioinformatics field, to study the application of machine learning (ML) methods. Therefore, these techniques have been utilized as an aim to model the progression and treatment of cancerous conditions. In addition, the ability of ML tools to detect key features from complex datasets reveals their importance. Support vector machine has become an increasingly popular tool for machine learning tasks involving classification, regression or novelty detection. Training a support vector machine requires the solution of a very large quadratic programming problem. Up to now, several approaches exist for circumventing the above shortcomings and work well with the dataset. And besides, till now the project has confined its attempt to diagnose breast cancer only. In this way, we can affirm that the prognosis of cancer can be achieved, and accordingly, we can produce outputs for the same.
HumiraSaria
No description available
kelvin482
A Django web application for breast cancer diagnosis prediction using machine learning and the Wisconsin Breast Cancer Dataset (WBCD). Features user authentication, doctor/patient interfaces, and ML-powered risk assessment.
Breast Cancer Dataset for Machine Learning This repository contains a labeled breast cancer dataset designed for classification and predictive modeling tasks in medical data analysis. The dataset is suitable for machine learning and deep learning approaches focused on breast cancer diagnosis and risk prediction.
Maximus2k04
The Breast Cancer Prediction System is a machine learning-based solution designed to predict the risk of breast cancer by analyzing patient data, including imaging results, genetic history, and clinical information. This system aims to assist healthcare professionals in early diagnosis and personalized treatment planning
Shubham06032003
AI-powered web app for breast cancer risk prediction. User-friendly interface, real-time diagnosis probability, radar chart visualization, and key medical metric support. Built with Python, Streamlit, scikit-learn, and Plotly. Educational use only.
This project predicts diabetes, breast cancer, and heart disease using machine learning algorithms on medical datasets. It applies models like Logistic Regression and Random Forest to support early diagnosis and risk prediction based on patient health data.
All 8 repositories loaded