https://ivanstudyblog.github.io/ Hello :) After 220 days posting here, I am moving my blog to https://ivanstudyblog.github.io/ Please head onto there for the latest days The UI there is a bit more flexible and customisable so I will continue my learning journey there. That is all for today :) See you in the new blog.
Hello :) Today is Day 182! A quick summary of today: learning about IV, WoE, and finding a best model for an imbalanced insurance fraud imbalanced dataset The time has come to start thinking about the project for MLOps zoomcamp. I was looking around for some interesting dataset related to PD (probability of default) or LGD (loss given default) or EAD (exposure at default), and I found this notebook. Warning - it is fairly long. But inside I saw something that interested me - it talked about WoE and IV. It says that they are good estimators for evaluating features for fraud and similar classification tasks. This website's definition was the most clear. Weight of Evidence (WoE) It is a technique used in credit scoring and predictive modeling to assess the predictive power of independent variables relative to a dependent variable. Originating from the credit risk world, WoE measures the separation between "good" and "bad" customers. Here, "bad" custom...
Hello :) Today is Day 173! A quick summary of today: learned more about terraform and how to set up a GCP VM and connect to it locally used mage for some data engineering pipelines with GCP Last videos from Module 1: terraform variables , GCP set up Turns out there is a bit more of terraform from the data eng zoomcamp, and today I covered it. After learning how to connect to gcp using terraform and create a storage bucket, the first thing today was creating a bigquery dataset Adding the above to main.tf which now looks like: terraform apply, creates a demo_dataset as well Then I learned about variables in terraform Create a variables.tf file and put a variable like: and in main.tf we can directly use the created variables like: Great intro to terraform - being able to define infrastructure code, create resources, and destroy resources. The next part was an instruction on setting up GCP (cloud VM + SSH access) First was creating an ssh key locally And add it to the metadata in GCP'...