ML Zoomcamp 2023 – Machine Learning for Regression – Part 2

Exploratory data analysis (EDA) – General Information Exploratory data analysis (EDA) is an essential step in the data analysis process. It involves summarizing and visualizing the main characteristics of a dataset to gain insights and identify patterns or trends. By exploring the data, researchers can uncover hidden relationships between variables and make informed decisions. OneContinue reading “ML Zoomcamp 2023 – Machine Learning for Regression – Part 2”

ML Zoomcamp 2023 – Machine Learning for Regression – Part 1

Car price prediction project This chapter is about the implementation of an ML project and which steps have to be considered. It is about the prediction of car prices based on a Kaggle dataset. The steps will be described in individual blog posts and include: Data preparation – General Information In the data preparation phase,Continue reading “ML Zoomcamp 2023 – Machine Learning for Regression – Part 1”

ML Zoomcamp 2023 – Introduction to Machine Learning – Part 13

Overview: Introduction to Pandas – Part 2/2 Filtering Filtering refers to the process of selecting specific rows or columns from a DataFrame based on certain conditions. In Pandas, we can use various techniques to filter our data. One common technique is to use boolean indexing. This involves creating a boolean mask that specifies the conditionsContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 13”

ML Zoomcamp 2023 – Introduction to Machine Learning – Part 12

Overview: Introduction to Pandas – Part 1/2 The last part of the Introduction to ML covers the Python package Pandas. Pandas is a powerful and versatile data manipulation library in Python. It provides data structures and functions that are essential for data analysis and preprocessing tasks. With Pandas, you can easily load, manipulate, and analyzeContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 12”

ML Zoomcamp 2023 – Introduction to Machine Learning – Part 11

Overview: Linear algebra refresher – Part 3/3 This is the third and last part of the refresher and covers special matrix types and its implementation in Python. Special matrix types Identity matrix An identity matrix is a special type of square matrix in linear algebra. It is denoted as I and has ones along itsContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 11”

ML Zoomcamp 2023 – Introduction to Machine Learning – Part 10

Overview: Linear algebra refresher – Part 2/3 This is the second part of the refresher and covers more sophisticated operations and its implementation in Python. Vector vector multiplication (dot product) The dot product, also known as the scalar product, is a key operation in linear algebra. It involves multiplying the corresponding components of two vectorsContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 10”

ML Zoomcamp 2023 – Introduction to Machine Learning – Part 9 

Overview: This part is a linear algebra refresher. There are different operations you need to understand for ML. Actually, you don’t really need to know that if you know what you’re doing in the code. However, a basic understanding can’t do any harm at this point. Because this article has become quite long, I decidedContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 9 “

ML Zoomcamp 2023 – Introduction to Machine Learning – Part 8

Overview: Introduction to NumPy part 3/3 Randomly generated arrays In addition to creating and manipulating multi-dimensional arrays, NumPy also provides the capability to generate arrays filled with random values. This can be useful in various scientific and mathematical applications, as well as for testing and simulation purposes. To create a randomly generated array, you canContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 8”

ML Zoomcamp 2023 – Introduction to Machine Learning – Part 7

Overview: Introduction to NumPy part 2/3 Multi-dimensional arrays One of the key advantages of NumPy is its ability to efficiently work with multi-dimensional arrays. These arrays can have any number of dimensions, from one-dimensional arrays (vectors) to two-dimensional arrays (matrices) to higher-dimensional arrays. To create a multi-dimensional array in NumPy, you can use the numpy.array()Continue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 7”

ML Zoomcamp 2023 – Introduction to Machine Learning – Part 6

Overview: Introduction to NumPy part 1/3 Numpy is a powerful Python library that provides support for large, multi-dimensional arrays and matrices, along with a wide range of mathematical functions to operate on these arrays. By importing the NumPy module as np, we gain access to all of its functionality and can easily manipulate arrays inContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 6”