Wrangling MLB Pitchf/x Data with Python
.MP4 | Video: 1280×720, 30 fps(r) | Audio: AAC, 48000 Hz, 2ch | 1.28 GB
Duration: 2.5 hours | Genre: eLearning | Language: English
Charting MLB GameDay – PitchF/x data using Jupyter Notebooks, Python and MatDescriptionLib
What you’ll learn
How to find MLB game and pitch data in Gameday.
How to create and program a Jupyter Notebook in Python.
How to extract XML pitch data from the MLB website.
How to coerce XML tree data into a Pandas Dataframe.
How to extract Dataframe slices into multiple views.
How to Description pitch data with MatDescriptionlib and PyDescription graphs.
Adding data columns to a Pandas Dataframe.
Descriptionting pitch tendency as pie charts (by ball-strike count).
Basic programming is helpful.
In the 2006 playoffs, Major League Baseball debuted a pitch tracking camera system called PitchF/x. Now installed in every MLB stadium, the system has been continually extended and re-branded. From cameras to TrackMan radar, from StatCast, to GameDay – MLB now tracks every pitch and every player’s movement on each pitch. The data are made public on the MLB web site and SaberMetricians world-wide pour over every detail. The teams themselves, average five or more statisticians dedicated to analyzing the data to aid in selecting and improving players.
I’m Chaz Henry – a software engineer, 12 year little league coach and founder of the PowerChalk dot com website. In this class, we’re going to open a fresh Jupyter Notebook, grab the MLB game data from Clayton Kershaw’s 2014 no-hitter and wrangle that data in Python. It’s an introduction in SaberMetrics – the empirical study of baseball statistics.
We’ll use built-in Python libraries and graph the pitches with MatDescriptionLib and PyDescription. Along the way we’ll talk about best practices for Jupyter Notebook, Python coding, XML parsing and maybe a little baseball.
So, if you’re a coder, a SaberMetrician or a just a baseball fan who wants to peek behind the curtain at what’s driving MoneyBall and the next wave of player development, sign up for the course and let’s start scrubbing the pitch data from one of the greatest pitching performances in MLB history.
Who this course is for:
Beginner or intermediate Python programmers.
SaberMetric baseball fans.
(Buy premium account for maximum speed and resuming ability)