The repo for mlbtradetrees.com. Analyze any trade in baseball history!

Last update: Nov 20, 2022

Related tags

Data Analysis BaseballTradeTrees

Overview

MLB Trade Trees

2.0.0 Release: November 24, 2021

www.mlbtradetrees.com allows you to view the trade tree of any player in MLB history.

What is a trade tree?

A trade tree will show you the complete details of a trade made by a team. Let's use Hall Of Fame candidate Cliff Lee for some examples, as he was traded multiple times throughout his career..

Here is the simplest form of his tree:

Cliff Lee was traded to the Mariners in 2009, and the Phillies received 3 players in return. All players the Phillies received in return either retired or became free agents, ending the tree with them.

Let's take a look at a more complicated example:

We can see the Mariners traded away Cliff Lee in 2010, receiving 4 players in return. 2 Players' lines end due to free agency and being picked up on waivers. 2 players' lines continue due to being traded away the next year. Some of those players' lines end however some continue to be traded away, so the tree grows. The tree finally ends in 2014 due to the final player hitting free agency.

Some of these trees can get pretty massive, spanning decades and dozens of trades. An example is Harry Simpson.

The Database

The transaction, team and player databases are thanks to Retrosheet. I will only update transactions when they update the database.

I have made some adjustments to the database that allows the search to go more smoothly:

Transaction database (data/sorted_transactions_final.csv)

Nan players involved in trades were changed to "PTBNL/Cash" (player to be named later). Most of the time you see this in a tree, it is a cash transaction.
Transactions of players that were released or granted free agency, then signed back with the team as their next transaction were deleted as it caused trees to end prematurely.
Franchise tags were added to the database to ensure that a team name change doesn't end a tree.

Team database (data/teams.csv)

All teams in the database received a franchise tag if they are part of the same franchise. They received a unique franchise code if they are an independant team.

Player database (data/teams.csv)

Nothing changed, just made a copy with the full name to easily get the user input. (static/css/searchable_players.csv)

Installing Locally

If you want to run the website locally:

install flask
install pandas
install JSGlue (allows Jinja to work in a js file)

Run server.py

What am I working on?

Updated Nov. 24 2021

Some players don't display properly due to having very old teams not listed in the teams database. Usually these are players before 1920. I just need to update the transactions database to find all teams without the franchise tag.
Adding stat support with pybaseball. I'd like to add total war contributed by players in a trade on the tree.
Searching for and filtering trees based on team, year, players in a tree, length of trees, etc.
Various UI enhancements, like clickable nodes to get a player's tree, collapsable nodes for easier readability.

The repo for mlbtradetrees.com. Analyze any trade in baseball history!

Related tags

Overview

MLB Trade Trees

2.0.0 Release: November 24, 2021

www.mlbtradetrees.com allows you to view the trade tree of any player in MLB history.

What is a trade tree?

The Database

Transaction database (data/sorted_transactions_final.csv)

Team database (data/teams.csv)

Player database (data/teams.csv)

Installing Locally

What am I working on?

Updated Nov. 24 2021

Owner

A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.

PySpark Structured Streaming ROS Kafka ApacheSpark Cassandra

This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot.

Data pipelines built with polars

MIR Cheatsheet - Survival Guidebook for MIR Researchers in the Lab

A crude Hy handle on Pandas library

PyEmits, a python package for easy manipulation in time-series data.

Weather Image Recognition - Python weather application using series of data

A data parser for the internal syncing data format used by Fog of World.

Functional tensors for probabilistic programming

Average time per match by division

A collection of robust and fast processing tools for parsing and analyzing web archive data.

CubingB is a timer/analyzer for speedsolving Rubik's cubes, with smart cube support

A Python module for clustering creators of social media content into networks

NumPy and Pandas interface to Big Data

PipeChain is a utility library for creating functional pipelines.

Lale is a Python library for semi-automated data science.

Python tools for querying and manipulating BIDS datasets.

Statistical & Probabilistic Analysis of Store Sales, University Survey, & Manufacturing data

Utilize data analytics skills to solve real-world business problems using Humana’s big data