Splay tree

Splay tree

A splay tree is a self-balancing binary search tree with the additional property that recently accessed elements are quick to access again. It performs basic operations such as insertion, look-up and removal in O(log(n)) amortized time. For many non-uniform sequences of operations, splay trees perform better than other search trees, even when the specific pattern of the sequence is unknown. The splay tree was invented by Daniel Sleator and Robert Tarjan.

All normal operations on a binary search tree are combined with one basic operation, called "splaying". Splaying the tree for a certain element rearranges the tree so that the element is placed at the root of the tree. One way to do this is to first perform a standard binary tree search for the element in question, and then use tree rotations in a specific fashion to bring the element to the top. Alternatively, a top-down algorithm can combine the search and the tree reorganization into a single phase.

Advantages and disadvantages

Good performance for a splay tree depends on the fact that it is self-balancing, and indeed self optimizing, in that frequently accessed nodes will move nearer to the root where they can be accessed more quickly. This is an advantage for nearly all practical applications, and is particularly useful for implementing caches and garbage collection algorithms; however it is important to note that for uniform access, a splay tree's performance will be considerably (although not asymptotically) worse than a somewhat balanced simple binary search tree.

Splay trees also have the advantage of being considerably simpler to implement than other self-balancing binary search trees, such as red-black trees or AVL trees, while their average-case performance is just as efficient. Also, splay trees do not need to store any bookkeeping data, thus minimizing memory requirements. However, these other data structures provide worst-case time guarantees, and can be more efficient in practice for uniform access.

One worst case issue with the basic splay tree algorithm is that of sequentially accessing all the elements of the tree in the sorted order. This leaves the tree completely unbalanced (this takes n accesses - each a O(log "n") operation). Reaccessing the first item triggers an operation that takes O(n) operations to rebalance the tree before returning the first item. This is a significant delay for that final operation, although the amortized performance over the entire sequence is actually O(log "n"). However, recent research shows that randomly rebalancing the tree can avoid this unbalancing effect and give similar performance to the other self-balancing algorithms.Fact|date=February 2008

It is possible to create a persistent version of splay trees which allows access to both the previous and new versions after an update. This requires amortized O(log "n") space per update.

Contrary to other types of self balancing trees, splay trees work well with nodes containing identical keys. Even with identical keys, performance remains amortized O(log "n"). All tree operations preserve the order of the identical nodes within the tree, which is a property similar to stable sorting algorithms. A carefully designed find operation can return the left most or right most node of a given key.

The splay operation

When a node "x" is accessed, a splay operation is performed on "x" to move it to the root. To perform a splay operation we carry out a sequence of "splay steps", each of which moves "x" closer to the root. By performing a splay operation on the node of interest after every access, the recently accessed nodes are kept near the root and the tree remains roughly balanced, so that we achieve the desired amortized time bounds.

Each particular step depends on three factors:
* Whether "x" is the left or right child of its parent node, "p",
* whether "p" is the root or not, and if not
* whether "p" is the left or right child of "its" parent, g (the "grandparent" of x).

The three types of splay steps are:

Zig Step: This step is done when "p" is the root. The tree is rotated on the edge between "x" and "p". Zig steps exist to deal with the parity issue and will be done only as the last step in a splay operation and only when "x" has odd depth at the beginning of the operation.

Zig-zig Step: This step is done when "p" is not the root and "x" and "p" are either both right children or are both left children. The picture below shows the case where "x" and "p" are both left children. The tree is rotated on the edge joining "p" with "its" parent "g", then rotated on the edge joining "x" with "p". Note that zig-zig steps are the only thing that differentiate splay trees from the "rotate to root" method introduced by Allen and Munro prior to the introduction of splay trees.

Zig-zag Step: This step is done when "p" is not the root and "x" is a right child and "p" is a left child or vice versa. The tree is rotated on the edge between "x" and "p", then rotated on the edge between "x" and its new parent "g".

Performance theorems

There are several theorems and conjectures regarding the worst-case runtime for performing a sequence S of "m" accesses in a splay tree containing "n" elements.

;Balance TheoremCitation | first1=Daniel D. | last1=Sleator | first2=Robert E. | last2=Tarjan | title=Self-Adjusting Binary Search Trees | url=http://www.cs.cmu.edu/~sleator/papers/self-adjusting.pdf|journal=Journal of the ACM | volume=32 | issue=3 | pages=pp. 652-686 | year= 1985] : The cost of performing the sequence "S" is O(m(log n + 1)+nlog n),!. In other words, splay trees perform as well as static balanced binary search trees on sequences of at least "n" accesses.;Static Optimality Theorem: Let q_i be the number of times element "i" is accessed in "S". The cost of performing "S" is OBigl(m+sum_{i=1}^n q_ilogfrac{m}{q_i}Bigr). In other words, splay trees perform as well as optimum static binary search trees on sequences of at least "n" accesses.;Static Finger Theorem: Let i_j be the element accessed in the j^{th} access of "S" and let "f" be any fixed element (the finger). The cost of performing "S" is OBigl(m+nlog n+sum_{j=1}^m log(|i_j-f|+1)Bigr).;Working Set Theorem: Let t(j) be the number of distinct elements accessed between access "j" and the previous time element i_j was accessed. The cost of performing "S" is OBigl(m+nlog n+sum_{j=1}^m log(t(j)+1)Bigr).;Dynamic Finger TheoremR. Cole, B. Mishra, J. Schmidt, A. Siegel. "On the Dynamic Finger Conjecture for Splay Trees. Part I: Splay Sorting log n-Block Sequences". SIAM Journal on Computing 30, pages 1-43, 2000.] R. Cole. "On the Dynamic Finger Conjecture for Splay Trees. Part II: The Proof". SIAM Journal on Computing 30, pages 44-85, 2000.] : The cost of performing "S" is OBigl(m+n+sum_{j=1}^m log(|i_{j+1}-i_j|+1)Bigr).;Scanning TheoremR.E. Tarjan. "Sequential access in splay trees takes linear time". Combinatorica 5, pages 367-378, 1985.] : Also known as the Sequential Access Theorem. Accessing the "n" elements of a splay tree in symmetric order takes Theta(n),! time, regardless of the initial structure of the splay tree. The tightest upper bound proven so far is 4.5n.Amr Elmasry. "On the sequential access theorem and deque conjecture for splay trees". Theor. Comput. Sci. 314(3), pages 459-466, 2004.]

Dynamic optimality conjecture

In addition to the proven performance guarantees for splay trees there is an unproven conjecture of great interest from the original Sleator and Tarjan paper. This conjecture is known as the "dynamic optimality conjecture" and it basically claims that splay trees perform as well as any other binary search tree algorithm up to a constant factor.

:Dynamic Optimality Conjecture: Let A be any binary search tree algorithm that accesses an element x by traversing the path from the root to x at a cost of d(x)+1, and that between accesses can make any rotations in the tree at a cost of 1 per rotation. Let A(S) be the cost for A to perform the sequence S of accesses. Then the cost for a splay tree to perform the same accesses is O(n + A(S)).

There are several corollaries of the dynamic optimality conjecture that remain unproven:

:Traversal Conjecture: Let T_1 and T_2 be two splay trees containing the same elements. Let S be the sequence obtained by visiting the elements in T_2 in preorder ("i.e." depth first search order). The total cost of performing the sequence S of accesses on T_1 is O(n).

:Deque Conjecture:S. Pettie. "Splay Trees, Davenport-Schinzel Sequences, and the Deque Conjecture". In Proceedings 19th ACM-SIAM Symposium on Discrete Algorithms, pages 1115--1124, 2008.] R. Sundar. "On the deque conjecture for the splay algorithm". Combinatorica 12(1):95--124, 1992.] Let S be a sequence of m double-ended queue operations (push, pop, inject, eject). Then the cost of performing S on a splay tree is O(m + n).

:Split Conjecture:J. Lucas. "On the Competitiveness of Splay Trees: Relations to the Union-Find Problem". Online Algorithms, DIMACS Series in Discrete Mathematics and Theoretical Computer Science Vol. 7, pages 95--124, 1991.] Let S be any permutation of the elements of the splay tree. Then the cost of deleting the elements in the order S is O(n).

ee also

* Donald Knuth. "The Art of Computer Programming", Volume 3: "Sorting and Searching", Third Edition. Addison-Wesley, 1997. ISBN 0-201-89685-0. Page 478 of section 6.2.3.
* finger tree
* Link/cut tree
* Scapegoat tree

References

External links

Algorithm

* [http://www.cs.cmu.edu/~sleator/papers/self-adjusting.pdf "Self-adjusting Binary Search Trees", Sleator and Tarjan (the original publication)]
* [http://www.nist.gov/dads/HTML/splaytree.html NIST's Dictionary of Algorithms and Data Structures: Splay Tree]

Implementations

* [http://www.link.cs.cmu.edu/link/ftp-site/splaying/ Implementations in C and Java by Sleator (one of the original inventors)]
* [http://fxr.watson.org/fxr/source//sys/tree.h FreeBSD's single header file implementation]

Visualizations

* [http://www.cs.nyu.edu/algvis/java/SplayTree.html New York University: Dept of Computer Science: Algorithm Visualization: Splay Trees]
* Pointers to [http://web-cat.cs.vt.edu/AlgovizWiki/SplayTrees splay tree visualizations]
* [http://www.ibr.cs.tu-bs.de/lehre/ss98/audii/applets/BST/SplayTree-Example.html Splay Tree Applet]
* [http://webpages.ull.es/users/jriera/Docencia/AVL/AVL%20tree%20applet.htm AVL, Splay and Red/Black Applet]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Splay — can also refer to::*An audio player intended for console use.:*A Japanese band.:*An SGC term made up by Scott Johnson:See also Splay tree.Splay is a physiological term that refers to the difference between urine threshold (the amount of a… …   Wikipedia

  • Splay-Baum — In der Informatik ist ein Splay Baum (auch Spreizbaum genannt, englisch Splay tree) eine baumartige Datenstruktur zum Verwalten verschiedener Elemente. Die Besonderheit von Splay Bäumen ist, dass sie erwartet balanciert sind, wodurch alle… …   Deutsch Wikipedia

  • Tree (data structure) — A simple unordered tree; in this diagram, the node labeled 7 has two children, labeled 2 and 6, and one parent, labeled 2. The root node, at the top, has no parent. In computer science, a tree is a widely used data structure that emulates a… …   Wikipedia

  • Tree rotation — A tree rotation is an operation on a binary search tree that changes the structure without interfering with the order of the elements. A tree rotation moves one node up in the tree and one node down. They are used to change the shape of the tree …   Wikipedia

  • Splay-дерево — Расширяющееся дерево (англ. splay tree) является двоичным деревом поиска, в котором поддерживается свойство сбалансированности. Это дерево принадлежит классу «саморегулирующихся деревьев», которые поддерживают необходимый баланс ветвления дерева …   Википедия

  • Binary search tree — In computer science, a binary search tree (BST) is a binary tree data structurewhich has the following properties: *each node (item in the tree) has a value; *a total order (linear order) is defined on these values; *the left subtree of a node… …   Wikipedia

  • AVL tree — In computer science, an AVL tree is a self balancing binary search tree, and it is the first such data structure to be invented. [Robert Sedgewick, Algorithms , Addison Wesley, 1983, ISBN 0 201 06672 6, page 199, chapter 15: Balanced Trees.] In… …   Wikipedia

  • Scapegoat tree — In computer science, a scapegoat tree is a self balancing binary search tree, invented by Igal Galperin and Ronald L. Rivest. It provides worst case O(log n ) lookup time, and O(log n ) amortized insertion and deletion time.Unlike other self… …   Wikipedia

  • Binary tree — Not to be confused with B tree. A simple binary tree of size 9 and height 3, with a root node whose value is 2. The above tree is unbalanced and not sorted. In computer science, a binary tree is a tree data structure in which each node has at… …   Wikipedia

  • Hash tree — A binary hash tree In cryptography and computer science Hash trees or Merkle trees are a type of data structure[citation needed] which contains a tree of summary information about a larger piece of da …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”