Every now and then I see in technical forums questions such as this: How do I find the most occurring word in my Word document? Or PDF document? Or Emails? etc. The question is a valid one if that’s something you’re curious about or need to find out in your professional role. In this blog, […]
Tag: datascience
Turning visual map results into a database/dataset
I often get questions such as: “How do I export Google business data into a database?” or “How do I import Google locations data into an Excel sheet?” They’re both pointing to the same need. The scenario is that we can search on Google Maps or Bing Maps or just on any online Search engine […]
Extracting phone numbers from any document/file
Imagine you have documents of various types: email, Word documents, PDF, text, HTML, Excel, etc. And you want to extract phone numbers, and phone numbers only from those documents as you try to build a list of contact information or a database. In this post, I show a quick and effective way to do just […]
Combining multiple datasets into a single data file (Excel or CSV)
The Scenario I want to analyze data for New England Patriots over the past 10 seasons. The data files are downloaded as a file by year. So, for 2010 through 2019 NFL season, I have 10 separate data files. I need to combine them all into a single file so I can do an all-up […]
Working with clipboard, dataframe, and data files
In this blog, I’ll share tips on: How to read data into a dataframe, create a subset of dataset in memory Read/write curated subset from/to clipboard as Text/CSV or XLSX Create new data files CSV/Excel The Scenario I have a large dataset. Checking its shape via Python, I see it has 18241 records with 9 […]
Using Python, SQL, and Excel together for data shaping & analysis
There are times when I’d like to take a peek at my data CSV or XLS/X even before firing up Excel. Usually when the dataset is large and I want to quickly inspect it and modify it for my needs before even putting Excel to work. For that purpose, Python is very useful. Add its […]
Analyzing 10 Years of Patriots Performance: NFL
Love em or hate em, New England Patriots is a team to reckon with. They’ve implemented numerous creative and suspicious internal strategies that most other teams would love to copy or at least understand. Well, I cannot expose those details as they’re not available anywhere. What is available however is their scores and stats. So, […]
Tracking Actual vs. My Predictions 4/29/20
Back in March 2020, I did some predictive modeling using simple math and Excel where I presented 3 scenarios: Likely, Best-case, Worst-case. You can read the original blog here: What-If Models (COVID-19): Results Let’s take a look at today’s latest data from WHO and compare my Model 1: Likely Scenario numbers. We’ll compare the USA […]
COVID-19 Update 4-16-20
Another somber perspective of WHO’s official COVID-19 data released 4/16/2020:
COVID-19 Update 4-14-20
Showing Corona virus cases and deaths across selected countries for March 1st, 2020 to April 14th, 2020.