Search Results

Found 1,373 repositories(showing 30)

Udacity-Data-Engineering-Projects

san089

💛78

Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.

1.9k

580

NOASSERTION

Python

Updated 1 day ago

airflowairflow-operatorsaws+17

Data-engineering-nanodegree

Flor91

🧡62

Projects done in the Data Engineering Nanodegree by Udacity.com

272

176

MIT

Jupyter Notebook

Updated 2 weeks ago

awscassandradata-engineering+5

Udacity-Data-Engineer-nanodegree

immu0001

🧡51

Classwork projects and home works done through Udacity data engineering nano degree

Apache-2.0

Jupyter Notebook

Updated 1 month ago

airflow-dagsbig-dataclasswork+10

Data-Engineering-Nanodegree

manuel-lang

❤️35

Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift, Data Lake with Spark and Data Pipeline with Airflow.

Jupyter Notebook

Updated 10 months ago

airflowcassandradata-engineering+5

Data-Engineering-Capstone-Project

Modingwa

❤️46

Udacity Data Engineering Nanodegree Capstone Project

Jupyter Notebook

Updated 2 months ago

Udacity-Data-Engineering-Projects

BenSchr

❤️35

My solutions for the Udacity Data Engineering Nanodegree

Jupyter Notebook

Updated 11 months ago

airflowcassandrapostgresql+4

Data-Engineering-With-AWS

Lal4Tech

🧡55

Resources and projects from Udacity Data Engineering with AWS nano degree programme

Jupyter Notebook

Updated 3 weeks ago

awscassandradata-engineering+3

Udacity-Data-Engineering-Nanodegree

AhmadChaiban

❤️35

Udacity's 5 Month Data Engineering Nanodegree program. This repo includes all the projects completed.

Jupyter Notebook

Updated 1 year ago

udacity

anefischer

🧡65

Projects I implemented to finish Udacity Nanodegree Programs from Data Engineering to Machine Learning Engineering.

MIT

HTML

Updated 6 days ago

artificial-intelligencedata-analystdata-science+16

project-postgres

kenhanscombe

❤️35

Udacity data engineering nanodegree project

Jupyter Notebook

Updated 1 year ago

data-engineeringdocker-imagepostgres-database+2

Udacity_DEND

amalphonse

❤️25

This repo contains my projects from the Udacity Data Engineering Nano degree

Jupyter Notebook

Updated 2 years ago

Udacity-Data-Engineering-Projects

scurtis94

❤️40

Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.

MIT

Updated 3 months ago

Data-Warehouse-with-AWS

bondxue

❤️40

:mushroom:Udacity Data Engineering Nanodegree Project 3

MIT

Python

Updated 6 months ago

aws-s3data-warehouseredshift

Retail_Data_Analytics

patrickbrus

❤️35

A Machine Learning project for retail data analytics as part of the Machine Learning Engineering Nanodegree Capstone Project from Udacity

Jupyter Notebook

Updated 9 months ago

Udacity-Data-Engineering-Nanodegree

rbmayer

❤️35

Udacity Data Engineering Nanodegree Projects

Jupyter Notebook

Updated 2 years ago

Project-Algorithm-for-a-Dog-Identification-App

mthd98

❤️35

Project Overview Welcome to the Convolutional Neural Networks (CNN) project in the AI Nanodegree! In this project, you will learn how to build a pipeline that can be used within a web or mobile app to process real-world, user-supplied images. Given an image of a dog, your algorithm will identify an estimate of the canine’s breed. If supplied an image of a human, the code will identify the resembling dog breed. Sample Output Along with exploring state-of-the-art CNN models for classification, you will make important design decisions about the user experience for your app. Our goal is that by completing this lab, you understand the challenges involved in piecing together a series of models designed to perform various tasks in a data processing pipeline. Each model has its strengths and weaknesses, and engineering a real-world application often involves solving many problems without a perfect answer. Your imperfect solution will nonetheless create a fun user experience! Project Instructions Instructions Clone the repository and navigate to the downloaded folder. git clone https://github.com/udacity/dog-project.git cd dog-project Download the dog dataset. Unzip the folder and place it in the repo, at location path/to/dog-project/dogImages. Download the human dataset. Unzip the folder and place it in the repo, at location path/to/dog-project/lfw. If you are using a Windows machine, you are encouraged to use 7zip to extract the folder. Download the VGG-16 bottleneck features for the dog dataset. Place it in the repo, at location path/to/dog-project/bottleneck_features. (Optional) If you plan to install TensorFlow with GPU support on your local machine, follow the guide to install the necessary NVIDIA software on your system. If you are using an EC2 GPU instance, you can skip this step. (Optional) If you are running the project on your local machine (and not using AWS), create (and activate) a new environment. Linux (to install with GPU support, change requirements/dog-linux.yml to requirements/dog-linux-gpu.yml): conda env create -f requirements/dog-linux.yml source activate dog-project Mac (to install with GPU support, change requirements/dog-mac.yml to requirements/dog-mac-gpu.yml): conda env create -f requirements/dog-mac.yml source activate dog-project NOTE: Some Mac users may need to install a different version of OpenCV conda install --channel https://conda.anaconda.org/menpo opencv3 Windows (to install with GPU support, change requirements/dog-windows.yml to requirements/dog-windows-gpu.yml): conda env create -f requirements/dog-windows.yml activate dog-project (Optional) If you are running the project on your local machine (and not using AWS) and Step 6 throws errors, try this alternative step to create your environment. Linux or Mac (to install with GPU support, change requirements/requirements.txt to requirements/requirements-gpu.txt): conda create --name dog-project python=3.5 source activate dog-project pip install -r requirements/requirements.txt NOTE: Some Mac users may need to install a different version of OpenCV conda install --channel https://conda.anaconda.org/menpo opencv3 Windows (to install with GPU support, change requirements/requirements.txt to requirements/requirements-gpu.txt): conda create --name dog-project python=3.5 activate dog-project pip install -r requirements/requirements.txt (Optional) If you are using AWS, install Tensorflow. sudo python3 -m pip install -r requirements/requirements-gpu.txt Switch Keras backend to TensorFlow. Linux or Mac: KERAS_BACKEND=tensorflow python -c "from keras import backend" Windows: set KERAS_BACKEND=tensorflow python -c "from keras import backend" (Optional) If you are running the project on your local machine (and not using AWS), create an IPython kernel for the dog-project environment. python -m ipykernel install --user --name dog-project --display-name "dog-project" Open the notebook. jupyter notebook dog_app.ipynb (Optional) If you are running the project on your local machine (and not using AWS), before running code, change the kernel to match the dog-project environment by using the drop-down menu (Kernel > Change kernel > dog-project). Then, follow the instructions in the notebook. NOTE: While some code has already been implemented to get you started, you will need to implement additional functionality to successfully answer all of the questions included in the notebook. Unless requested, do not modify code that has already been included. Evaluation Your project will be reviewed by a Udacity reviewer against the CNN project rubric. Review this rubric thoroughly, and self-evaluate your project before submission. All criteria found in the rubric must meet specifications for you to pass. Project Submission When you are ready to submit your project, collect the following files and compress them into a single archive for upload: The dog_app.ipynb file with fully functional code, all code cells executed and displaying output, and all questions answered. An HTML or PDF export of the project notebook with the name report.html or report.pdf. Any additional images used for the project that were not supplied to you for the project. Please do not include the project data sets in the dogImages/ or lfw/ folders. Likewise, please do not include the bottleneck_features/ folder.

Jupyter Notebook

Updated 6 months ago

Data-Modeling-with-Postgres

Wathon

❤️45

Udacity Data Engineering Nano Degree Project, Data Modeling for fact and dimension tables, and ETL pipeline that transfers data from files in two local directories into these tables in Postgres using Python and SQL.

Jupyter Notebook

Updated 1 month ago

datamodelingetl-pipelinepostgres+1

udacity-data-engineering-projects

lucaskjaero

❤️35

Projects submitted as part of working through udacity's data engineering nanodegree.

Jupyter Notebook

Updated 5 years ago

udacity-data-engineering-capstone

dai-dao

❤️35

Capstone Project for Udacity Data Engineering Nanodegree

Python

Updated 1 year ago

data-engineering-nd

BarbaraJoebstl

❤️35

Projects of the Udacity Data Engineering Nanodegree Program.

Jupyter Notebook

Updated 1 year ago

cassandradata-engineeringdata-engineering-nanodegree+4

udacity-dend-capstone

kudeh

❤️35

Udacity Data Engineering Nanodegree Capstone Project

Jupyter Notebook

Updated 11 months ago

airflowdagsjupyter-notebook+7

Data-Pipelines-with-Apache-Airflow

write4alive

❤️35

Data Engineering Nano Degree Programm of Udacity - Project 5 - Data Pipelines with Apache Airflow

Python

Updated 10 months ago

airflowdata-engineeringetl+2

Udacity-Data-Engineering-Nanodegree

polo2444172276

🧡55

Completed Udacity's data engineering nano degree. Went through a series of exercises and projects to learn and practice the trendy big data management tools.

PLpgSQL

Updated 4 weeks ago

airflowawsaws-ec2+9

Starbucks-Capstone-Project-ML-Udacity-aws

MariamGado0

❤️35

# Starbucks Promotions Project ### This project is the Capstone Project of Udacity's Machine Learning Engineering Nanodegree program. ![intro](/images0/x.png) ![intro](/images0/v.png) ![intro](/images0/s.png) ## Problem Statement This data set contains simulated data that mimics customer behavior on the Starbucks rewards mobile app. Once every few days, Starbucks sends out an offer to users of the mobile app. An offer can be merely an advertisement for a drink or an actual offer such as a discount or BOGO (buy one get one free). Some users might not receive any offer during certain weeks. Not all users receive the same offer, and that is the challenge to solve with this data set. The task is to combine transaction, demographic and offer data to determine which demographic groups respond best to which offer type. This data set is a simplified version of the real Starbucks app because the underlying simulator only has one product whereas Starbucks actually sells dozens of products. Starbucks collects the customer data to understand their behaviour on the rewards and offers sent via the mobile-app. Once every few days, Starbucks sends the personalised offers to its customers. These customers can respond positively/negatively/neutrally. A key thing to note is that not all the customers receive the same offer. The task of this project is to combine transaction, demographic and offer data of the past (which is already provided) to determine which demographic groups respond best to which offer types. In order to develop this project, we needed to use some tools, packages, systems and services that could help us achieve our goals. #### Libraries First of all, we used **Python** to write our scripts not only for algorithm training and serving but also for the orchestration of the whole process. Important packages within this environment are listed below: This project is developed in Python 3.6. You will need install some libraries in order to run the code. Libraries are: * `pandas` so we could work with tabular data in dataframes; * `Ploty` so we could visualize our Dataset; * `matplotlib` for Dataset visualization; * `numpy` so we could easily manipulate arrays and data structures; * `seaborn` and `matplotlib` so we could generate insightful visualizations; * `sklearn` so we could build and develop our model pipeline; * `imblearn` so we could apply SMOTE to our training data; * `xgboost` so we could have our main classifier; * `sagemaker` so we could easily interact with AWS. * `json` for reading our Dataset Files. * `boto3` Finally, we used AWS environment in order to launch training jobs, deploy our model and serve predictions. The main services used are also listed below: * __AWS SageMaker__: training, hyperparameter tuning and endpoint serving; * __Amazon S3__: saving our data and model artifacts; ## Files Descriptions This project is structured as follows: #### 01. Proposal Project proposal documentation. #### 02. Data_Cleaning_[Dataset] Folder to perform data preparation and Dataset Cleaning and Prepare the Final Data for Further using in model algorithms. #### 03. Pre-processing Dataset Visualization Folder to perform final Pre-processing Dataset to be used in Visualization and exploration. #### 04. Dataset_Visualization Folder to perform Visualizations for the Pre-processed Dataset. #### 06. ORG_Starbucks_Capstone_Project.ipynb Jupyter notebook file that deploy final model and create an endpoint and orchestrates the end-to-end process in AWS SageMaker and also interacts with other services.

HTML

Updated 6 months ago

udacity-dataeng-airflow

cheuklau

❤️35

Udacity data engineering Airflow project

Python

Updated 2 years ago

udacity-dataeng-project1

danielmt

❤️20

Udacity Data Engineering Nanodegree Project 1 - Data Modeling with Postgres

Jupyter Notebook

Updated 3 years ago

Udacity-Data-Engineering-Projects

naderAsadi

❤️40

Data Engineering Nanodegree projects and exercises, including Data Modeling, Data Warehousing, Data Lake development, and Pipeline Management.

MIT

Jupyter Notebook

Updated 2 years ago

airflowawscassandra+8

Data-Warehouse-with-AWS

Wathon

❤️40

Udacity Data Engineering Nano Degree Project, ETL for Data Warehouse using S3 and Amazon Redshift.

GPL-3.0

Python

Updated 2 years ago

amazon-redshiftamazon-s3etl+1

Data-Engineering-ND

Federico-abss

❤️35

My projects for the Udacity Data Engineering ND

Jupyter Notebook

Updated 5 years ago

Data-Integration-Pipelines-for-NYC-Payroll-Data-Analytics

qanhnn12

❤️40

Project 4: Udacity Nanodegree Program - Data Engineering with Microsoft Azure

TSQL

Updated 2 months ago

azure-containerazure-data-factoryazure-synapse-analytics+4

GitHub Explorer

Search Results

Udacity-Data-Engineering-Projects

Data-engineering-nanodegree

Udacity-Data-Engineer-nanodegree

Data-Engineering-Nanodegree

Data-Engineering-Capstone-Project

Udacity-Data-Engineering-Projects

Data-Engineering-With-AWS

Udacity-Data-Engineering-Nanodegree

udacity

project-postgres

Udacity_DEND

Udacity-Data-Engineering-Projects

Data-Warehouse-with-AWS

Retail_Data_Analytics

Udacity-Data-Engineering-Nanodegree

Project-Algorithm-for-a-Dog-Identification-App

Data-Modeling-with-Postgres

udacity-data-engineering-projects

udacity-data-engineering-capstone

data-engineering-nd

udacity-dend-capstone

Data-Pipelines-with-Apache-Airflow

Udacity-Data-Engineering-Nanodegree

Starbucks-Capstone-Project-ML-Udacity-aws

udacity-dataeng-airflow

udacity-dataeng-project1

Udacity-Data-Engineering-Projects

Data-Warehouse-with-AWS

Data-Engineering-ND

Data-Integration-Pipelines-for-NYC-Payroll-Data-Analytics

Udacity-Data-Engineering-Projects

Data-engineering-nanodegree

Udacity-Data-Engineer-nanodegree

Data-Engineering-Nanodegree

Data-Engineering-Capstone-Project

Udacity-Data-Engineering-Projects

Data-Engineering-With-AWS

Udacity-Data-Engineering-Nanodegree

udacity

project-postgres

Udacity_DEND

Udacity-Data-Engineering-Projects

Data-Warehouse-with-AWS

Retail_Data_Analytics

Udacity-Data-Engineering-Nanodegree

Project-Algorithm-for-a-Dog-Identification-App

Data-Modeling-with-Postgres

udacity-data-engineering-projects

udacity-data-engineering-capstone

data-engineering-nd

udacity-dend-capstone

Data-Pipelines-with-Apache-Airflow

Udacity-Data-Engineering-Nanodegree

Starbucks-Capstone-Project-ML-Udacity-aws

udacity-dataeng-airflow

udacity-dataeng-project1

Udacity-Data-Engineering-Projects

Data-Warehouse-with-AWS

Data-Engineering-ND

Data-Integration-Pipelines-for-NYC-Payroll-Data-Analytics