Machine Learning | 2023-10-14
Machine learning models (including language models) can be easily deployed using generous free tier on Hugging Face and a python-based open source UI tool Gradio by following these steps. See live deployed app and source code here For local...
Machine Learning | 2022-05-07
Contents Algorithms Bayes Explainability MLOps Model Evaluation Preprocessing Reinforcement Learning SQL Statistics Algorithms K-means: aims to choose centroids that minimize the inertia, or within-cluster sum-of-squares criterion. Use the...
Machine Learning | 2021-12-18
Contents Summary Code Summary The purpose of this post is to propose a template for machine learning projects that strives to follow these principles: All data scientists can quickly setup an identical development environment based on Docker that...
Machine Learning | 2021-11-13
Contents Summary Notebook Summary Keras LSTM can be a powerful tool for forecasting. Below is a simple template notebook showing how to setup a data science forecasting experiment. Dataset A synthetic dataset was generated using a scikit-learn...
Machine Learning | 2021-08-30
Contents Summary Notebook Summary In general, a machine learning pipeline should have the following characteristics: To ensure data consistency, the pipeline should include every step (such as feature engineering) required to train and score...
Machine Learning | 2021-05-16
In this article I will leverage the global temperate dataset I discussed previously to make a temperature forecast using Facebook Prophet for the next 50 years. Note: the temperature dataset serves ONLY as a vehicle to learn how to do forecasting...
Random | 2021-05-14
Berkeley Earth publishes an unique dataset with global temperature measurements. Below is a guide to the download the data and start analyzing it using Python. All code can be found in this gist. Download .txt file from Berkeley Earth data...
Machine Learning | 2021-03-27
This article is an extension of my previous article describing a similar deployment process using native AWS Lambda tools. However, Amazon since started supporting container images and updated it’s pricing policy to 1ms granularity. Both are...
Machine Learning | 2021-03-20
This article is a follow up to my previous tutorial on how to setup Google Colab and auto-sklean. Here, I will go into more detail that shows auto-sklearn performance on an artificially created dataset. The full notebook gist can be found here....
Machine Learning | 2020-12-04
Auto ML is fast becoming a popular solution to build minimal viable models for new projects. A popular library for Python is Auto-sklearn that leverages the most popular Python ML library scikit-learn. Auto-sklearn runs a smart search over...