Machine Learning : I did it, so can you too.

As promised, here’s my Python notebook which I used to generate my first set of predictions using machine learning. This is for passenger survival in the Titanic data set (Kaggle). The Titanic training data set contains 890 rows (Passengers), each with 12 columns including Name, Sex, Age, Ticket number, Passenger Class, Cabin name, Port of Embarkation, number of siblings. A relatively small data set but fun to play with as a novice without feeling overwhelmed! After some quick explorations of the data, I begin with some basic data munging. This is to prepare the data set for fitting...

Continue reading