In this post, I’ll walk through the steps starting with a raw dataset that’s organized differently from what I need, to reshaping the data using Excel formulas, and creating an interactive visualization using PowerBI, to publishing it online using PowerBI. Excel and PowerBI work hand-in-hand to solve such problems effectively, and efficiently. First, I’ll start […]
Category: Analytics
How to find the most occurring words in a document?
Every now and then I see in technical forums questions such as this: How do I find the most occurring word in my Word document? Or PDF document? Or Emails? etc. The question is a valid one if that’s something you’re curious about or need to find out in your professional role. In this blog, […]
Combining multiple datasets into a single data file (Excel or CSV)
The Scenario I want to analyze data for New England Patriots over the past 10 seasons. The data files are downloaded as a file by year. So, for 2010 through 2019 NFL season, I have 10 separate data files. I need to combine them all into a single file so I can do an all-up […]
Working with clipboard, dataframe, and data files
In this blog, I’ll share tips on: How to read data into a dataframe, create a subset of dataset in memory Read/write curated subset from/to clipboard as Text/CSV or XLSX Create new data files CSV/Excel The Scenario I have a large dataset. Checking its shape via Python, I see it has 18241 records with 9 […]
“God” game—revisited (aka Pop game)
What seems like yonks ago, I wrote a fun, DOS based visual game in college that’s a simple simulation of population growth with some user-inputs to the game (i.e. playing “God”, hence the title). You can read about it here. But here, I’m sharing info about that game, revisited, with a different twist. I rewrote […]
Data extraction with multiple criteria with interactive UI
Imagine we are business owners who provide a range of services. We have a large set of customer data but we really want to target those customers who might be interested in your services. We want to extract only that subset of data based on our services. And since we provide multiple, diverse services the […]
A Horse-race Simulation with a “Strip Plot”
Strip plots, or a dot plot horizontally spread out are very useful in certain cases. I’d use it to show spread or concentration of data points across multiple categories. For example, in order to visualize between brand A, and B, what are their product sales (or prices) and where they appear to focus. What the […]
Analyzing 10 Years of Patriots Performance: NFL
Love em or hate em, New England Patriots is a team to reckon with. They’ve implemented numerous creative and suspicious internal strategies that most other teams would love to copy or at least understand. Well, I cannot expose those details as they’re not available anywhere. What is available however is their scores and stats. So, […]
Tracking Actual vs. My Predictions 4/29/20
Back in March 2020, I did some predictive modeling using simple math and Excel where I presented 3 scenarios: Likely, Best-case, Worst-case. You can read the original blog here: What-If Models (COVID-19): Results Let’s take a look at today’s latest data from WHO and compare my Model 1: Likely Scenario numbers. We’ll compare the USA […]
COVID-19 Update 4-16-20
Another somber perspective of WHO’s official COVID-19 data released 4/16/2020: