Summary of ENGR 323 Data Science and Analytics

ENGR 323 Data Science and Analytics

This course is designed to provide students with a comprehensive understanding of data science and analytics concepts and techniques. The course covers the essential steps of the data science process, including data exploration, preparation, manipulation, and analysis. Students will gain hands-on experience with Python and Pandas for data manipulation, as well as statistical methods and data visualization techniques using tools like Matplotlib and Seaborn. The course introduces core machine learning concepts, such as supervised and unsupervised learning, regression analysis, classification techniques, and clustering methods. Students will explore the use of machine learning models, including linear regression, decision trees, and random forests, while learning to evaluate model performance using various metrics. In addition, the course covers important topics such as time series analysis, big data technologies (Hadoop and Spark), and the ethical considerations of data science, including privacy and data protection. Through a capstone project, students will integrate the concepts learned throughout the course, applying them to real-world data analysis challenges and presenting their findings. By the end of the course, students will be equipped with the necessary skills to handle and analyze data, apply machine learning algorithms, and make informed decisions based on data-driven insights.

Teacher: Mustafa Alattar

Skill Level: Beginner

Lecture Notes

ENGR 323 Data Science and Analytics

Part 01 Introduction

Part 02 Statistics Basics

Part 03a Programming

Part 03b Python Tutorial

Part 03c numpy manual

Part 03d Python and Numpy

Part 04 Data collection

Part 06 Data Exploration

Part 07 Data Visualization

Part 09 Data Manipulation

Part 10 Correlation

Part 11 Modeling

Part 12 Linear Regression Examples

Part 13 Model Calibration and Validation

Part 14 Sensitivity Analysis

Part 14b Built in Functions

Part 15 Bias and Variance

Part 16 Multiple Linear Regression

Part 17 Quantile Regression

Part 18 Probability and Odds

Part 19 Logistic Regression

Part 20 Data structure

Part 21 Artificial intelligence All

Part 22 Machine Learning

Part 23 Classification

Part 24 KNN

Part 25 K-means clustering

Part 26 Decision Tree and Random Forest

Part 27 neural network.

Lecture Video Recordings

Lecture Video Recordings

In class Submissions

Project template

1_Example Python Delimited values

2_ Practice File Excel Question

2_Practice File Excel Solution

March 31 submission (Deadline April 6)

April 2 Submission (Deadline April 7)

April 5 submission (Deadline April 7)

April 5 Data

April 7 Submission (Deadline April 9)

April 9 submission (Deadline April 12)

April 12 Midterm Exam 01 - Part 01 submission

April 14 Midterm Exam 01 - Part 02 submission

Midterm Exam 01 Grades

Midterm Exam 01 part 01 solution

Midterm Exam 01 part 02 solution

April 28 submission (Deadline April 30)

8_Example_01_linear_regression

May_3_submission (Deadline: May 3)

9_nonlinear_regression

Optional: Project (Deadline: May 17)

Review

Exam 02 Submission (Computer Based)

Meteorological data

Exam 02 Grades

Pre-final Grades

ENGR 323 Data Science and Analytics