Overview: Introduction to Pandas – Part 2/2 Filtering Filtering refers to the process of selecting specific rows or columns from a DataFrame based on certain conditions. In Pandas, we can use various techniques to filter our data. One common technique is to use boolean indexing. This involves creating a boolean mask that specifies the conditionsContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 13”
Category Archives: Introduction
ML Zoomcamp 2023 – Introduction to Machine Learning – Part 12
Overview: Introduction to Pandas – Part 1/2 The last part of the Introduction to ML covers the Python package Pandas. Pandas is a powerful and versatile data manipulation library in Python. It provides data structures and functions that are essential for data analysis and preprocessing tasks. With Pandas, you can easily load, manipulate, and analyzeContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 12”
ML Zoomcamp 2023 – Introduction to Machine Learning – Part 11
Overview: Linear algebra refresher – Part 3/3 This is the third and last part of the refresher and covers special matrix types and its implementation in Python. Special matrix types Identity matrix An identity matrix is a special type of square matrix in linear algebra. It is denoted as I and has ones along itsContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 11”
ML Zoomcamp 2023 – Introduction to Machine Learning – Part 10
Overview: Linear algebra refresher – Part 2/3 This is the second part of the refresher and covers more sophisticated operations and its implementation in Python. Vector vector multiplication (dot product) The dot product, also known as the scalar product, is a key operation in linear algebra. It involves multiplying the corresponding components of two vectorsContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 10”
ML Zoomcamp 2023 – Introduction to Machine Learning – Part 9
Overview: This part is a linear algebra refresher. There are different operations you need to understand for ML. Actually, you don’t really need to know that if you know what you’re doing in the code. However, a basic understanding can’t do any harm at this point. Because this article has become quite long, I decidedContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 9 “
ML Zoomcamp 2023 – Introduction to Machine Learning – Part 8
Overview: Introduction to NumPy part 3/3 Randomly generated arrays In addition to creating and manipulating multi-dimensional arrays, NumPy also provides the capability to generate arrays filled with random values. This can be useful in various scientific and mathematical applications, as well as for testing and simulation purposes. To create a randomly generated array, you canContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 8”
ML Zoomcamp 2023 – Introduction to Machine Learning – Part 7
Overview: Introduction to NumPy part 2/3 Multi-dimensional arrays One of the key advantages of NumPy is its ability to efficiently work with multi-dimensional arrays. These arrays can have any number of dimensions, from one-dimensional arrays (vectors) to two-dimensional arrays (matrices) to higher-dimensional arrays. To create a multi-dimensional array in NumPy, you can use the numpy.array()Continue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 7”
ML Zoomcamp 2023 – Introduction to Machine Learning – Part 6
Overview: Introduction to NumPy part 1/3 Numpy is a powerful Python library that provides support for large, multi-dimensional arrays and matrices, along with a wide range of mathematical functions to operate on these arrays. By importing the NumPy module as np, we gain access to all of its functionality and can easily manipulate arrays inContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 6”
ML Zoomcamp 2023 – Introduction to Machine Learning – Part 5
Overview: In my last post (Introduction to Machine Learning – Part 4) I wrote about the CRISP-DM ML Process with its six steps. This post is about the 4th step of that process – the modeling step. This means that the model selection process is also relevant here. Model Selection Process Imagine there is a modelContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 5”
ML Zoomcamp 2023 – Introduction to Machine Learning – Part 4
Overview: CRISP-DM Machine Learning Process This part is about the CRISP-DM Machine Learning Process (Cross-industry standard process for data mining). Methodologies like CRISP-DM help us to organize the ML project in a way that is manageable (what needs to happen in which order). Figure 1.4.1 is from Wikipedia. You can find more information on thatContinue reading “ML Zoomcamp 2023 – Introduction to Machine Learning – Part 4”