I’m not a statistician by profession or training. However, I find it fascinating with even the basics under my belt and find plethora of statistic’s practical usage. Without it, we’re really ignorant. With it, we’re equipped but not always best educated either. I heard the phrase again and again, “Correlation does not equal causation!” and […]
Tag: analytics
Analyzing NFL Quarterbacks Stats
Data Science in sports is a huge deal! Even though most professional sports teams have been using them as a core part of their strategy, it was relatively recently that it got much more attention in the NFL…as soon as Seahawks dedicated a team to analytics! As they blew out records in virtually all positions […]
Peek into Infographics
From newspapers and magazines, online and offline, Infographic is the rage. They convey numbers in an attractive, easy to understand visuals for most people. There are everywhere! In this blog, I’ll offer some practical advice, tips, and resources. First, the caveats… While Infographic can be a lot of fun to make, avoid these pitfalls and […]
Analyzing Age, Height, Weight, BMI connection
I pulled down some real data from CDC (https://wwwn.cdc.gov/) which are relatively recent and contains data on 2 year cycles. The data cover people’s age, height, and BMIs (broken up by genders). I wanted to see how age and height correlated, how height and weight correlated, and how BMIs vary by age for both male […]
How Many Dogs In the Park—in the Past and Future?
Here’s the Scenario: Based on our observations over a month, we find that there are 40 dogs on average in a off-leash park during an hour’s time-frame during the day in its open hours. We want to find out the chance of having a specific number of dogs at any given time in the park […]
Getting to know our presidents
Let’s gather and organize some facts about all the U.S. presidents to-date and slice and dice the data to see what insights we can garner. First, I looked at various, reliable sources and shaped the data in a way I could work with. For example, normalizing the data, converting to appropriate scales and units, bucketing, […]
Need a baby name? Or just love data?
In either case, read on. I collected a dataset of poplular baby names from OpenData government site of City of New York ranging from 2011 through 2016…exactly 19,418 records. Original dataset view: What I want to do is find out: a) Most popular names b) Less popular names (or rare names used) c) Slice it […]
How well are your incentives working?
Here’s the situation. You’re asking customers to sign up for your gym membership by offering some discounts. But how well are they working? Let’s find out (working with limited data). Information on hand is that you made 20 attempts or trials and find that 22% signed up. Your task is to calculate the various chances […]
Solving Real-world crimes with Bayes Theorem
Today, I present a real-life problem that actually goes back many years in terms of the type of problem and method to tackle such. Here’s the scenario: There was a robbery and the software has identified the perpetrator to be black. The case goes to jury and they must decide with the help of the […]
Anatomy of a goal (part 2)
This is a continuation of the original soccer goal tracking chart and animation posted @: https://flyingsalmon.net/blog/?p=869 It’s recommended that you read that blog first for context. In this blog, I added actual players involved in that goal (denoted by their jersey and individual numbers). The players data are added as an addition series (much like […]