I’ll start with a simple dataset and demonstrate how many critical questions can arise just from this. You can now imagine in a real-world situation, how many more complex questions can be asked and information gleaned! But if we can’t extract the key answers from this table, we won’t be able to tackle even the […]
Tag: STEM
Simple & brilliant way to extract multiple matches in one cell
The scenario: You have a dataset with values in one column that’s unique (or semi-unique) and another column with its associated values as Key:Value pairs but that column has repeating values. Imagine a list of members whose names are in one column and their corresponding membership status next to it as below: What we […]
How many real days…net? (Excel)
This is one of the simple and yet powerful tips when working with Dates in Excel. To calculate the number of workdays between two dates BUT EXCLUDING weekends (depending on your locale settings), we can use NETWORKDAYS(). In US and most western countries, Saturdays and Sundays would be excluded from the calculation. Without changing your […]
Phone number formatting tips (Excel and Python)
For most people working with data for a significant amount of time, formatting raw data from different sources are both a reality and a pain. Today, I show different ways to format an un-formatted string that contains phone numbers using Python, and Excel. Our task is to format it (either for human-reporting, or consumption by […]
Searching online (Google) with Python
In this blog, I demonstrate how to run a Google query from a Python app extremely easily. The harder part is just the set up and knowing what to install. So, let’s start with the set up details: You’ll need to install the following packages IN ORDER: 1) beautifulsoup4 2) google NOTE: The package to […]
Sample Size in Python
This is part of a 3-part series on the topic. Please read the posts in the order for maximum clarity and context: 1. Sample Size and Margin of Errors. Everything you need to know and ++ 2. Sample Size (Contd.) 3. Sample size in Python (This one) Ok, after reading the first 2 posts, you […]
Sample Size (Contd.)
This is part of a 3-part series on the topic. Please read the posts in the order for maximum clarity and context: 1. Sample Size and Margin of Errors. Everything you need to know and ++ 2. Sample Size (Contd.) 3. Sample Size in Python In this blog, we’ll use actual numbers to determine […]
Sample Size and Margin of Errors. Everything you need to know and ++
I’m not a statistician by profession or training. However, I find it fascinating with even the basics under my belt and find plethora of statistic’s practical usage. Without it, we’re really ignorant. With it, we’re equipped but not always best educated either. I heard the phrase again and again, “Correlation does not equal causation!” and […]
Running code from within another code (Python)
Most of us who have done any programming know that virtually all languages allow you to execute another program from within a program. For example, you start your program, which may start another process (via calling Execute or the like) that’s a self-contained program by itself, resulting in another process or spawn another thread as […]
Analyzing NFL Quarterbacks Stats
Data Science in sports is a huge deal! Even though most professional sports teams have been using them as a core part of their strategy, it was relatively recently that it got much more attention in the NFL…as soon as Seahawks dedicated a team to analytics! As they blew out records in virtually all positions […]