Trees: Self-Balancing Trees
Trees: Self-Balancing Trees
Self-balancing trees are marvels of computer science. Take that, apple trees. When implemented correctly, self-balancing trees balance themselves as they’re created, never needing any outside interference to look nice and balanced. All you have to do is set up that nifty, self-balancing structure when you first start adding data to your tree. Then you can sit back, relax, and re-watch Back to the Future. Your tree will take care of itself.
(Bet you've never seen an apple tree juggling its branches and apples like that.)
We could try to explain how it all works, but it’s really best to observe these trees in their native habitat. Keep quiet, don’t make any sudden moves, and maybe we’ll get to see a self-balancing BST in action.
You know, we don’t really like waiting. Let’s just make one.
We’ll use the numbers 34, 22, 17, 13, 37, 39, and 25 as nodes. At every step, the tree keeps track of each node’s subtree height, just like we did in the Balancing section. Once a node has a difference in its two subtrees of at least two (02 or 20), the tree's going to readjust and rebalance itself.
First comes the root node, 34.
data:image/s3,"s3://crabby-images/224f5/224f5be3134a467f3a07f27100a64b5b94dbc5cc" alt="A single node of 34."
If we throw 22 in, it'll become 34's left child. Because…22 is less than 34.
data:image/s3,"s3://crabby-images/98f4c/98f4c51a49b1e2f8aadde6f146bfcbc8b33cff37" alt="34, marked as 10, with a left child 22, marked with a 00."
Next we'll throw in 17, which ends up as the left child of 22.
data:image/s3,"s3://crabby-images/aece9/aece93609aa7c04aedcf773d2f7db72f2806b75e" alt="The same tree as before, but 17 is now the left child of 22. 17's marked with a 00, 22 is marked with 10, and 34's marked with a 20."
Uh oh. This tree has a problem. The tree's now unbalanced, which means it has to do something.
All you have to do is press the unpause button on your movie; the tree's going to do all the work by rotating its branches to rebalance itself. After the rotation's done, the tree will double-check to make sure everything is balanced.
The last node added (17) was the left-left descendant of the node with the super-unbalanced score (34). Since everything's to the left of the root, we know that logically the root could be a right child of either one. But because of the attachment between 22 and 17, we also know that it can't be 17's right child, since being 17's right child would mean being 22's left grandchild. 34's greater than 22, so it has to become 22's right child. It'll do something like this:
data:image/s3,"s3://crabby-images/49ca9/49ca9862b322a430c52f0003f292c36597caddb1" alt="34 with its left arrow grayed out and a red arrow pointing to its movement down to the right of 22. Now 22 is the root."
Boom. Perfectly balanced.
data:image/s3,"s3://crabby-images/3edb0/3edb0d5b2e394c16cb014b92e8d7b0db4738ee37" alt="22 is the root, with a left child of 17 and a right child of 34. All three nodes have a rating of 00."
To make things interesting, let's throw in a nice, unlucky 13.
data:image/s3,"s3://crabby-images/1c783/1c783f05d205fed149411990d7a32b3a63782eb1" alt="13's now the left child of 17, giving it the code 10. 22 also now has a code of 10."
So far so good. Next comes 37, which goes into 22's right subtree as 34's right child.
data:image/s3,"s3://crabby-images/db682/db68230f788819d3d762b5e6892eb83180be0d21" alt="Both 37 and 13 have the code 00, 17 and 34 have a code of 10 and 01, respectively. Plus side: now 22 has a code of 00."
Here's where things get interesting. If we throw in 39 to the right of 37, we'll get some serious balancing issues.
[gulp]
data:image/s3,"s3://crabby-images/6c3ad/6c3addc4f9859f0597fcf168c3cd952c949a1a96" alt="39 is now the right child of 37, giving 37 the code 01. 34, its parent now has a code of 02."
Here's where things get complicated. A subtree on the right is unbalanced, meaning that the tree's going to spring into action. We'll basically do the same thing as before, rotating that 02 node (34) so that it's a child of the 01 node (37). If you implement this type of tree, you'll need to be very careful about how nodes point to each other so that you don't lose anything when you juggle them.
data:image/s3,"s3://crabby-images/99142/99142897715abdfd3e110c2cb8a8c3653e7c4ff3" alt="34's arrow towards 37 is grayed out, with a red arrow showing its movement down to become the left child of 37."
Just like that, we're balanced again. Only one node left.
data:image/s3,"s3://crabby-images/fd51a/fd51a1173294c65e7175ca7b5760d782c57f5fdd" alt="The balanced tree. Now the only node <em>not</em> 00 is 17, which only has a left child of 13. That makes its code 10.."
Let's throw in 25 just for good luck.
data:image/s3,"s3://crabby-images/2b379/2b379b3ad4be116a40d4152fbb50a96f82d6260f" alt="25 shows up as the left child of 34, making 34's, 37's and 22's codes all 01."
Phew! No more rebalancing needed. The tree's finished. Notice how it isn't the optimal shape? If we're really splitting hairs, the best possible tree would have a height of 2 instead of 3, but self-balancing trees are built to have random values added to them at any point. Taking the time to make this tree perfect would become useless the minute we add a new value. It's much more sustainable to just keep it from getting a little too unbalanced.
Basically balanced is good enough.
How Things Can Become Unbalanced
There are four different ways a node can unbalance a tree:
- left-left
- right-right
- left-right
- right-left
We've seen the first two, and they're usually the easiest, but any self-respecting tree (or its programmer) can also handle the left-right and right-left cases.
Say we've got this right-left situation. 58 was the most recently added node:
data:image/s3,"s3://crabby-images/971bb/971bb91ad6edd2815ff679662380e6a8313cd6af" alt="45 is the root, with a code of 02. 69, its right child, has a code of 10 because 58, 69's child is on the left."
The easiest thing to do is make this look like a right-right case instead of a hairy right-left. To do that, we'll just switch 69 and 58 so that 58 is the parent.
data:image/s3,"s3://crabby-images/a5406/a540669f62ed9ff2c2df844caa6b1ec92b7d580d" alt="The tree from before, except the arrow down to 58 is grayed out with a red arrow showing its movement up to become the parent of 69."
Now that we're in a state we've seen before, we can just treat it like that previous case. that means turning 45 into a child of the new root: 58.
data:image/s3,"s3://crabby-images/d32d1/d32d1bd0649c1474802ea416f523427a4ecbfba1" alt="45's arrow down to 58 is grayed out with a red arrow showing its movement down to become the left child of the new root: 58."
Left-right is going to follow a similar formula, but mirrored. We have a root of 35, with a left child: 19. 19's right child, 21, was just added in.
data:image/s3,"s3://crabby-images/6ca9d/6ca9d7d87cb584d38662281edf9eec7e5d2bfadf" alt="35 is the root with a code of 20. 19, its left child, has a code of 01 because 19's right child is 21."
The easiest fix is to turn this into a left-left problem by making 21 the parent of 19. That way, we'll have a situation we already know how to deal with.
data:image/s3,"s3://crabby-images/ef759/ef759e8f7ccb48cd1494ecf68797651146848748" alt="The arrow from 19 down to 21 is grayed out with a red arrow showing 21's movement to become the parent of 19."
Now we're free to apply that left-left correction, making 21 the new root and 35 its right child.
data:image/s3,"s3://crabby-images/4e275/4e275fae84afe8bc112afe46eb9e406fa6bf4f89" alt="35's arrow down to 21 is grayed out with a red arrow showing its movement to become 21's right child."
Done.
Self-balancing makes things convenient, but it can also cost a lot of time to maintain through the life of the data structure. Between balancing and checking to make sure it's balanced, depending on how big your tree gets and how often you need to find different nodes, it might not be the most efficient way to go.
You've been warned.