Sunday, September 21, 2025
Submit Press Release
Got Action
No Result
View All Result
  • Home
  • Football
  • Basketball
  • NCAA
    • NCAA Football
    • NCAA Basketball
    • NCAA Baseball
    • NCAA Sport
  • Baseball
  • NFL
  • NBA
  • NHL
  • MLB
  • Formula 1
  • MMA
  • Boxing
  • Tennis
  • Golf
  • Sports Picks
  • Home
  • Football
  • Basketball
  • NCAA
    • NCAA Football
    • NCAA Basketball
    • NCAA Baseball
    • NCAA Sport
  • Baseball
  • NFL
  • NBA
  • NHL
  • MLB
  • Formula 1
  • MMA
  • Boxing
  • Tennis
  • Golf
  • Sports Picks
Got Action
No Result
View All Result

Build an Environment for Sports Data Analysis

February 8, 2025
in NCAA Sport
0 0
0
Home NCAA Sport
Share on FacebookShare on Twitter


If you follow this blog, chances are that you’ve seen and perhaps even walked through my guide on building an environment for analysis. That article is from 5 years ago and I still get questions and feedback on it to this day. To be clear, I still think it’s a perfectly valid way to build an environment and to this day I still primarily use the Docker setup outlined in the guide. However, I find myself starting to gravitate more and more towards a non-Docker environment.

Docker is great and I still use it for many things, but lately I’ve found that it eats up a lot of resources on my local machine so I don’t always have it running. The base Docker image I shared in the previous article is still published and available for anyone to use, but it has been increasingly challenging to maintain and keep up-to-date via automation. You can still use that image and it still works great in my experience, but recently gained appreciation for a more lightweight approach.

These are the tools used in this approach:

VS Code
Jupyter
Python (with virtual environments)
The CFBD and CBBD Python packages

If you’ve never used VS Code before as an IDE, you should be checking it out. It’s long been my IDE of choice for everything else and it provides a fantastic experience for working with Jupyter notebooks. What has put it over the top for me and caused my to use it more and more for data analytics task is GitHub Copilot. GitHub Copilot has become something that I am no longer able to live without. You may be familiar with my recent rewrite of the CFBD API, website, and most associated infrastructure. You may also be familiar with my recent foray into basketball with CollegeBasketballData.com. I wouldn’t have been able to do any of this without Copilot. It’s probably at least halved my development time on the above. And it works seamlessly with Jupyter notebooks in VS Code.

Just as with the previous guide, this guide should work whether you are on Windows, Mac, or Linux. I am a Windows user and still highly recommend setting up Windows Subsystem for Linux (WSL) with your favorite Linux flavor (I use Ubuntu) if you are also in Windows. I do all my development (personal and professional) exclusively in WSL.

Prerequisites are that you have the following installed:

You will also need some VS Code extensions, at the very least the Python and Jupyter extensions. Here is the list of extensions I am running for this tutorial:

Open up a terminal window. Let’s create a directory called jupyter and move into that directory.

mkdir jupyter
cd jupyter

Next, we’re going to create a Python virtual environment. This is always a good practice as allows you to work with different Python versions and package versions across different folders/repos.

python -m venv ./venv

This should have created a venv folder with the Python binaries and some scripts. We are going to activate the virtual environment we just created by running:

source ./venv/bin/activate

Note that this command may differ for Mac and non-WSL Windows. Refer to the documentation linked above for instructions specific to those OSes.

Next we will install a list of commonly used Python packages. Feel free to add any others you may need. We will also write these packages into a requirements.txt file for easy installation.

pip install cbbd cfbd ipykernel matplotlib numpy pandas scikit-learn xgboost
pip freeze > requirements.txt

Let’s create an empty Jupyter notebook and open this directory in VS Code.

touch test.ipynb
code .

Inside VS Code, open the test.ipynb file from the left sidebar. Then, click on “Select Kernel” in the top-right and then “Python Environments…” from the dropdown list that appears.

Select the environment labeled venv. There should be a star next to it.

Now we can begin working in the Jupyter notebook. Let’s start by importing the cfbd and pandas packages and running the code block.

import cfbd
import pandas as pd

If you didn’t install the ipykernel package with the list of packages above, you may be greeted with the below prompt. Just click ‘Install’ and wait.

Next, let’s configure the CFBD package with our CFBD API key. If you do not have a key, you can acquire one from the website. Replace the text below with your personal key.

configuration = cfbd.Configuration(
access_token = ‘your_key_here’
)

We can now call the API to grab a list of games:

with cfbd.ApiClient(configuration) as api_client:
games_api = cfbd.GamesApi(api_client)

games = games_api.get_games(year=2024, classification=’fbs’)

len(games)

In my example, there were 920 games returned. It’s pretty easy to load those into a Pandas DataFrame.

df = pd.DataFrame.from_records([g.to_dict() for g in games])
df.head()

One neat trick using the Python library is that every method has a special version that will also include the HTTP response metadata. Simply attach _with_http_info to the end of the method. You can use this to keep track of how many monthly calls you have remaining.

with cfbd.ApiClient(configuration) as api_client:
games_api = cfbd.GamesApi(api_client)

response = games_api.get_games_with_http_info(year=2024, classification=’fbs’)

response.headers[‘X-CallLimit-Remaining’]

And then access the same as before data via the response.data field.

games = response.data
df = pd.DataFrame.from_records([g.to_dict() for g in games])
df.head()

And that is all there is to it!

I do still love Docker for many things and think it is still perfectly adequate to use for a data analytics environment. However, you can see how this approach is much more lightweight and allows you to leverage the full capabilities of VS Code. We didn’t really dig into the GitHub Copilot extension. If you didn’t install, then I cannot recommend it enough as it is a gamechanger.

Some other tweaks that people make include swapping out pip for conda. However, I have found the above setup to be more than adequate. Anyway, happy coding!



Source link

Tags: AnalysisBuildDataEnvironmentSports
Previous Post

Interdonato looking for improvement in Year 2 at BC

Next Post

UFC 312 Recap: Knockouts, Submissions, and Dominating Decisions

Related Posts

CFB Week 4 winners, losers: Indiana, Texas Tech score wins
NCAA Sport

CFB Week 4 winners, losers: Indiana, Texas Tech score wins

September 21, 2025
Jackson Arnold doesn’t hold back on loss to Oklahoma
NCAA Sport

Jackson Arnold doesn’t hold back on loss to Oklahoma

September 21, 2025
Examining Mike Gundy’s contract buyout following Oklahoma State’s loss to Tulsa
NCAA Sport

Examining Mike Gundy’s contract buyout following Oklahoma State’s loss to Tulsa

September 20, 2025
How to watch the Florida vs. Miami NCAA college football game: Kickoff time, channel, where to stream and more
NCAA Sport

How to watch the Florida vs. Miami NCAA college football game: Kickoff time, channel, where to stream and more

September 20, 2025
Three CFB teams that are on upset alert in Week 4
NCAA Sport

Three CFB teams that are on upset alert in Week 4

September 20, 2025
Wildcats QB commit making waves in senior season
NCAA Sport

Wildcats QB commit making waves in senior season

September 19, 2025
Next Post
UFC 312 Recap: Knockouts, Submissions, and Dominating Decisions

UFC 312 Recap: Knockouts, Submissions, and Dominating Decisions

Super Bowl 59 prop bet mega-post! – Cole’s Gameday Blog

Super Bowl 59 prop bet mega-post! – Cole's Gameday Blog

Leave a Reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Away Attendances at the Weekend – 8th September 2025

Away Attendances at the Weekend – 8th September 2025

September 8, 2025
2025 European F1 Camping Guide

2025 European F1 Camping Guide

March 14, 2025
Bills QB Josh Allen to have college jersey retired at Wyoming

Bills QB Josh Allen to have college jersey retired at Wyoming

September 6, 2025
Tua Tagovailoa says relationship with Tyreek Hill ‘good’

Tua Tagovailoa says relationship with Tyreek Hill ‘good’

September 4, 2025
Questions to Ask Interested College Coaches and Recruiters

Questions to Ask Interested College Coaches and Recruiters

August 26, 2025
BYU Shuts Out Houston, Advances in 2025 Big 12 Softball Tournament

BYU Shuts Out Houston, Advances in 2025 Big 12 Softball Tournament

May 8, 2025
Anthony Davis could return to Mavericks’ lineup during upcoming Eastern road trip: Report

Anthony Davis could return to Mavericks’ lineup during upcoming Eastern road trip: Report

94
Getting with the programme | Arseblog … an Arsenal blog

Getting with the programme | Arseblog … an Arsenal blog

2
The longest an NCAA bracket has ever stayed perfect

The longest an NCAA bracket has ever stayed perfect

1
Hanshin Tigers shut out Cubs, Dodgers for back-to-back wins

Hanshin Tigers shut out Cubs, Dodgers for back-to-back wins

1
Man United Back On Winning Ways as They Shock Chelsea | Football news at 1000Goals.com: Football Betting, Highlights, and More

Man United Back On Winning Ways as They Shock Chelsea | Football news at 1000Goals.com: Football Betting, Highlights, and More

0
Cedric Coward 100 perc Cleared For Grizzlies Training Camp

Cedric Coward 100 perc Cleared For Grizzlies Training Camp

0
Team World Takes Commanding Lead into Sunday at Laver Cup – Tennis Now

Team World Takes Commanding Lead into Sunday at Laver Cup – Tennis Now

September 21, 2025
Minnesota Twins place Pablo Lopez on IL, ending his season

Minnesota Twins place Pablo Lopez on IL, ending his season

September 21, 2025
Oscar Collazo Retains Title After Jayson Vayson’s Corner Mysteriously Halts Competitive Bout – Boxing Results

Oscar Collazo Retains Title After Jayson Vayson’s Corner Mysteriously Halts Competitive Bout – Boxing Results

September 21, 2025
CFB Week 4 winners, losers: Indiana, Texas Tech score wins

CFB Week 4 winners, losers: Indiana, Texas Tech score wins

September 21, 2025
Misery Index: Arkansas’ embarrassing slide continues, and the Hogs are running out of excuses

Misery Index: Arkansas’ embarrassing slide continues, and the Hogs are running out of excuses

September 21, 2025
Blackhawks settle second lawsuit in Aldrich sexual assault case

Blackhawks settle second lawsuit in Aldrich sexual assault case

September 21, 2025
Facebook Twitter Instagram LinkedIn TikTok Pinterest
Got Action

Stay updated with the latest sports news, highlights, and expert analysis at Got Action. From football to basketball, we cover all your favorite sports. Get your daily dose of action now!

CATEGORIES

  • Baseball
  • Basketball
  • Boxing
  • Football
  • Formula 1
  • Golf
  • MLB
  • MMA
  • NBA
  • NCAA Baseball
  • NCAA Basketball
  • NCAA Football
  • NCAA Sport
  • NFL
  • NHL
  • Tennis
  • Uncategorized

SITEMAP

  • About us
  • Advertise with us
  • Submit Press Release
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2025 Got Action.
Got Action is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Football
  • Basketball
  • NCAA
    • NCAA Football
    • NCAA Basketball
    • NCAA Baseball
    • NCAA Sport
  • Baseball
  • NFL
  • NBA
  • NHL
  • MLB
  • Formula 1
  • MMA
  • Boxing
  • Tennis
  • Golf
  • Sports Picks
Submit Press Release

Copyright © 2025 Got Action.
Got Action is not responsible for the content of external sites.