One way of classifying texts into categories is to assign to a new given text the category of the most similar text in the training set. Given vector representations of the texts, there are many similarity metrics that can be chosen, however, some of them can yield the exact same accuracy.
This is a very short and sweet post on how to calculate the median for different subgroups of data that you want to aggregate like when using GROUP BY
Let’s say you hear that to become a world-star basketball player, you need to score at least 90% of your free throws. So you train for a day, a week, a month, a year… and you consider yourself ready. Now it is time to test. But… how many free throws do you need to shoot to accurately measure your free throw accuracy?
Last Monday, I started working @ Opinary (Berlin) as a Data Scientist. Wednesday, I was introduced to MySQL / BigQuery window functions. They’re likely to be one of the most useful (and one of my favourite) functions from now on. Let me very quickly describe what window functions are.
Have you ever wondered how to locate Wally with Machine Learning?
This is the second of a series of posts on BigData software and architecture. The main goal of this series is to not reinvent the wheel by re-developing a system that is already out there. In this post I’ll take a look at Pig, a high level language that can be compiled into MapReduce jobs.
This is the first of a series of posts on BigData software and architecture. The main goal of this series is to not reinvent the wheel by re-developing a system that is already out there. A more intelligent approach always takes into account the existing tools and either re-uses them or builds on top of them if necessary. In the unlikely case that your personal/company’s problem is different to anything out there already and you believe you need to develop it from scratch, it will always help to know which problems you will have to face in the near future.
This is the first of a series of paper summaries on Machine Learning
Disclaimer: This assumes you’ve read Docker 101 or that you know Docker basics.
Disclaimer: This is not an in-depth introduction to Docker and only aims at pointing out its utility and ease of use.
I will use this website as a blog to talk about things I learn about or that I found interesting enough to share with the Internet.