Machine Learning : I did it, so can you too.

As promised, here’s my Python notebook which I used to generate my first set of predictions using machine learning. This is for passenger survival in the Titanic data set (Kaggle). The Titanic training data set contains 890 rows (Passengers), each with 12 columns including Name, Sex, Age, Ticket number, Passenger Class, Cabin name, Port of Embarkation, number of siblings. A relatively small data set but fun to play with as a novice without feeling overwhelmed! After some quick explorations of the data, I begin with some basic data munging. This is to prepare the data set for fitting...

Continue reading

alphago

Data Science: My Journey from Doctor to Noob Kaggler.

In the last couple of months, I’ve been pursuing another hobby  – data science and artificial intelligence (AI). You might have followed the much-publicised match between Lee Sedol, one of the world’s best Go players, and AlphaGo, an AI system developed by Google that spurred discussions about the future of AI. (Spoiler : AlphaGo won 4 – 1) This has been partly triggered by my work at Holmusk, where our brilliant data science team has been winning multiple competitions and most recently took part in the Second Annual Data Science Bowl on Kaggle. This was a challenge...

Continue reading

Big Data and Chronic Diseases at Asia IoT Business Platform (4 Nov 2015)

Two weeks ago, I was invited to speak on the use of big data to fight chronic diseases at the Asia IoT Business Platform, held in Kuala Lumpur on 3 – 4 Nov 2015. Although I was only able to attend on the 2nd day, it was an interesting gathering of technologists and opportunists who are all interested in the potential for M2M and IoT to change the world. IoT = Internet of Things. It was my privilege to be able to share my personal viewpoints on how data science is going to transform medicine in the near future with an audience of over 200+ people, as well as connect with many new...

Continue reading

Discussing Genomics, Big Data and Medicine

For anyone who has an inkling of interest in the conversations around genomics, big data and medicine, I highly recommend this series of talks that are available publicly online, by the Icahn School of Medicine In particular, this was a great session by Tim Kelsey, who is the national director for patients and information in the NHS.  He touched upon many of the thorny issues we have to begin conversations about, as we move towards personalised medicine. If you don’t have time to watch it through (about an hour long), I’ve got some of the key points noted down: The UK has...

Continue reading

Machine Learning and Neural Networks

Because of the work I am involved in, I’ve been trying to get a better grasp around machine learning and artificial intelligence. The belief is that data science is going to do more to improve healthcare than systems biology in the next decade. And I’m a believer. Here’s a fun video I found that introduces neural networks – especially if you’re old enough to remember Mario!  

Continue reading