20 Nov 2021

Datasnips is a platform for you to create, save and organise your data science code snippets and discover those created by other users.

04 Sep 2021

In this tutorial series, we will look at how to get started with using Python and Matplotlib to visualise our data. In this first part we'll take you through the basics of how to create a simple line chart, how to customise and format it while working with instances of Matplotlib's figure and axes objects.

30 Aug 2021

In this tutorial we'll see how we can use the Keras ImageDataGenerator library from Tensorflow to create a model for classifying images. We'll be using the Image Data Generator to preprocess our images and also to feed our images into the model using the flow_from_dataframe function.

10 Aug 2021

Ok, here we go, i’ll stick my head above the parapet. There’s a debate that’s been gathering pace for a while now, a backlash against Kaggle by those exclaiming a number of points as to why Kaggle isn’t worth doing, that we should maybe not hold winners in such high esteem...

11 Jul 2021

XGBoost has many parameters that can be adjusted to achieve greater accuracy or generalisation for our models. Here we’ll look at just a few of the most common and influential parameters that we’ll need to pay most attention....

20 Dec 2020

A big part of analysing our models post training is whether the features we used for training actually helped in predicting the target and by how much. Tree based machine learning algorithms such as Random Forest and XGBoost come with a feature importance attribute that outputs an array containing a value between 0 and 100 for each feature representing how useful the mode....