Feature importance: Churn rate and risk ratio Feature importance analysis is a part of exploratory data analysis (EDA) and involves identifying which features affect our target variable. Churn rate Last time, we examined the global churn rate. Now, we are focusing on the churn rate within different groups. For example, we are interested in determiningContinue reading “ML Zoomcamp 2023 – Machine Learning for Classification– Part 5”
Tag Archives: Classification
ML Zoomcamp 2023 – Machine Learning for Classification– Part 4
EDA – Exploratory Data Analysis The topics that we cover in this section are: Checking missing values The following snippet indicates that the dataset ‘df_full_train’ contains no missing values: Looking at the target variable (churn) First what we can check is the distribution of our target variable ‘churn’. How many customers are churning and howContinue reading “ML Zoomcamp 2023 – Machine Learning for Classification– Part 4”
ML Zoomcamp 2023 – Machine Learning for Classification– Part 3
Setting up the validation framework Perform the train/validation/test split with Scikit-Learn You can utilize the train_test_split function from the sklearn.model_selection package to automate the splitting of your data into training, validation, and test sets. Before you can use it, make sure to import it first as follows: The train_test_split function divides the dataframe into twoContinue reading “ML Zoomcamp 2023 – Machine Learning for Classification– Part 3”
ML Zoomcamp 2023 – Machine Learning for Classification– Part 2
Data preparation The topics that we cover in this section are: Downloading the data First, we import all the necessary packages. Then, we can download our CSV file using the ‘wget’ command. When using Jupyter Notebook, it’s important to note that ‘!’ indicates the execution of a shell command, and the ‘$’ symbol, as seenContinue reading “ML Zoomcamp 2023 – Machine Learning for Classification– Part 2”
ML Zoomcamp 2023 – Machine Learning for Classification– Part 1
Churn prediction project In this project, we’re focusing on churn prediction for a telecommunications company. Consider a telecommunications company with a diverse customer base. Some of these customers are satisfied with the services they receive, while others are not. Those who are dissatisfied are contemplating terminating their contracts and switching to another service provider. WhatContinue reading “ML Zoomcamp 2023 – Machine Learning for Classification– Part 1”