- Heap (data structure)
-
This article is about the programming data structure. For the dynamic memory area, see Dynamic memory allocation.
In computer science, a heap is a specialized tree-based data structure that satisfies the heap property: if B is a child node of A, then key(A) ≥ key(B). This implies that an element with the greatest key is always in the root node, and so such a heap is sometimes called a max-heap. (Alternatively, if the comparison is reversed, the smallest element is always in the root node, which results in a min-heap.) There is no restriction as to how many children each node has in a heap, although in practice each node has at most two. The heap is one maximally-efficient implementation of an abstract data type called a priority queue. Heaps are crucial in several efficient graph algorithms such as Dijkstra's algorithm, and in the sorting algorithm heapsort.
A heap data structure should not be confused with the heap which is a common name for dynamically allocated memory. The term was originally used only for the data structure. Some early popular languages such as LISP provided dynamic memory allocation using heap data structures, which gave the memory area its name[1].
Heaps are usually implemented in an array, and do not require pointers between elements.
The operations commonly performed with a heap are:
- create-heap: create an empty heap
- find-max or find-min: find the maximum item of a max-heap or a minimum item of a min-heap, respectively
- delete-max or delete-min: removing the root node of a max- or min-heap, respectively
- increase-key or decrease-key: updating a key within a max- or min-heap, respectively
- insert: adding a new key to the heap
- merge: joining two heaps to form a valid new heap containing all the elements of both.
Contents
Variants
- 2-3 heap
- Beap
- Binary heap
- Binomial heap
- Brodal queue
- D-ary heap
- Fibonacci heap
- Leftist heap
- Pairing heap
- Skew heap
- Soft heap
Comparison of theoretic bounds for variants
The following time complexities[1] are amortized (worst-time) time complexity for entries marked by an asterisk, and regular worst case time complexities for all other entries. O(f) gives asymptotic upper bound and Θ(f) is asymptotically tight bound (see Big O notation). Function names assume a min-heap.
Operation Binary[1] Binomial[1] Fibonacci[1] Pairing[2] Brodal[3] create-heap Θ(1) Θ(1) Θ(1) ? O(1) findMin Θ(1) O(log n) Θ(1) O(1)* O(1) deleteMin Θ(log n) Θ(log n) O(log n)* O(log n)* O(log n) insert Θ(log n) O(log n) Θ(1) O(1)* O(1) decreaseKey Θ(log n) Θ(log n) Θ(1)* O(log n)* O(1) merge Θ(n) O(log n)** Θ(1) O(1)* O(1) (*)Amortized time
(**)Where n is the size of the larger heapApplications
The heap data structure has many applications.
- Heapsort: One of the best sorting methods being in-place and with no quadratic worst-case scenarios.
- Selection algorithms: Finding the min, max, both the min and max, median, or even the k-th largest element can be done in linear time (often constant time) using heaps.[4]
- Graph algorithms: By using heaps as internal traversal data structures, run time will be reduced by polynomial order. Examples of such problems are Prim's minimal spanning tree algorithm and Dijkstra's shortest path problem.
Full and almost full binary heaps may be represented in a very space-efficient way using an array alone. The first (or last) element will contain the root. The next two elements of the array contain its children. The next four contain the four children of the two child nodes, etc. Thus the children of the node at position
n
would be at positions2n
and2n+1
in a one-based array, or2n+1
and2n+2
in a zero-based array. This allows moving up or down the tree by doing simple index computations. Balancing a heap is done by swapping elements which are out of order. As we can build a heap from an array without requiring extra memory (for the nodes, for example), heapsort can be used to sort an array in-place.One more advantage of heaps over trees in some applications is that construction of heaps can be done in linear time using Tarjan's algorithm.[clarification needed]
Implementations
- The C++ Standard Template Library provides the make_heap, push_heap and pop_heap algorithms for heaps (usually implemented as binary heaps), which operate on arbitrary random access iterators. It treats the iterators as a reference to an array, and uses the array-to-heap conversion. Container adaptor priority_queue also exists.
- The Java 2 platform (since version 1.5) provides the binary heap implementation with class java.util.PriorityQueue<E> in Java Collections Framework.
- Python has a heapq module that implements a priority queue using a binary heap.
- PHP has both maxheap (SplMaxHeap) and minheap (SplMinHeap) as of version 5.3 in the Standard PHP Library.
- Perl has implementations of binary, binomial, and Fibonacci heaps in the Heap distribution available on CPAN.
See also
- Stack (data structure)
- Sorting algorithm
- Treap, a form of binary search tree based on heap-ordered trees
References
- ^ a b c d e Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest (1990): Introduction to algorithms. MIT Press / McGraw-Hill.
- ^ Iacono, John (2000), "Improved upper bounds for pairing heaps", Proc. 7th Scandinavian Workshop on Algorithm Theory, Lecture Notes in Computer Science, 1851, Springer-Verlag, pp. 63–77, doi:10.1007/3-540-44985-X_5
- ^ http://www.cs.au.dk/~gerth/papers/soda96.pdf
- ^ Frederickson, Greg N. (1993), "An Optimal Algorithm for Selection in a Min-Heap", Information and Computation, 104, Academic Press, pp. 197–214, doi:10.1006/inco.1993.1030, http://ftp.cs.purdue.edu/research/technical_reports/1991/TR%2091-027.pdf
Trees in computer science Binary trees Self-balancing binary search trees B-trees Tries Binary space partitioning (BSP) trees Non-binary trees Exponential tree · Fusion tree · Interval tree · PQ tree · Range tree · SPQR tree · Van Emde Boas treeSpatial data partitioning trees Other trees Heap · Hash tree · Finger tree · Metric tree · Cover tree · BK-tree · Doubly-chained tree · iDistance · Link-cut tree · Fenwick treeData structures Types Abstract Arrays Linked Trees Graphs Categories:- Heaps (structure)
Wikimedia Foundation. 2010.