- Shell sort
Shell sort is a
sorting algorithm that is a generalization ofinsertion sort , with two observations:
*insertion sort is efficient if the input is "almost sorted", and
*insertion sort is typically inefficient because it moves values just one position at a time.History
The Shell sort is named after its inventor,
Donald Shell , who published the algorithm in 1959. [cite journal |last=Shell |first=D.L. |title=A high-speed sorting procedure |journal=Communications of the ACM |volume=2 |issue=7 |year=1959 |pages=30–32 |doi=10.1145/368370.368387] Some older textbooks and references call this the "Shell-Metzner" sort afterMarlene Metzner Norton , but according to Metzner, "I had nothing to do with the sort, and my name should never have been attached to it." [cite web |title=Shell sort |publisher=National Institute of Standards and Technology |url=http://www.nist.gov/dads/HTML/shellsort.html |accessdate=2007-07-17 ]Implementation
The original implementation performs O("n"2) comparisons and exchanges in the worst case. A minor change given in V. Pratt's book [Cite book|last=Pratt|first=V|year=1979|publisher=Garland|title=Shellsort and sorting networks (Outstanding dissertations in the computer sciences)|id=ISBN 0-824-04406-1 (This was originally presented as the author's Ph.D. thesis, Stanford University, 1971)] improved the bound to O("n" log2 "n"). This is worse than the optimal
comparison sort s, which are O("n" log "n").Shell sort improves insertion sort by comparing elements separated by a gap of several positions. This lets an element take "bigger steps" toward its expected position. Multiple passes over the data are taken with smaller and smaller gap sizes. The last step of Shell sort is a plain insertion sort, but by then, the array of data is guaranteed to be almost sorted.
Consider a small value that is initially stored in the wrong end of the
array . Using an O("n"2) sort such asbubble sort orinsertion sort , it will take roughly "n" comparisons and exchanges to move this value all the way to the other end of the array. Shell sort first moves values using giant step sizes, so a small value will move a long way towards its final position, with just a few comparisons and exchanges.One can visualize Shellsort in the following way: arrange the list into a table and sort the columns (using an
insertion sort ). Repeat this process, each time with smaller number of longer columns. At the end, the table has only one column. While transforming the list into a table makes it easier to visualize, the algorithm itself does its sorting in-place (by incrementing the index by the step size, i.e. usingi += step_size
instead ofi++
).For example, consider a list of numbers like
[ 13 14 94 33 82 25 59 94 65 23 45 27 73 25 39 10 ]
. If we started with a step-size of 5, we could visualize this as breaking the list of numbers into a table with 5 columns. This would look like this:13 14 94 33 8225 59 94 65 2345 27 73 25 3910We then sort each column, which gives us
10 14 73 25 2313 27 94 33 3925 59 94 65 8245When read back as a single list of numbers, we get
[ 10 14 73 25 23 13 27 94 33 39 25 59 94 65 82 45 ]
. Here, the 10 which was all the way at the end, has moved all the way to the beginning. This list is then again sorted using a 3-gap sort, and then 1-gap sort (simple insertion sort).Gap sequence
The "gap sequence" is an integral part of the shellsort algorithm. Any increment sequence will work, as long as the last element is 1. The algorithm begins by performing a "gap insertion sort", with the gap being the first number in the gap sequence. It continues to perform a gap insertion sort for each number in the sequence, until it finishes with a gap of 1. When the gap is 1, the gap insertion sort is simply an ordinary
insertion sort , guaranteeing that the final list is sorted.The gap sequence that was originally suggested by
Donald Shell was to begin with and to halve the number until it reaches 1. While this sequence provides significant performance enhancements over the quadratic algorithms such asinsertion sort , it can be changed slightly to further decrease the average and worst-case running times. Weiss' textbook [Cite book|last=Weiss|first=Mark Allen|year=2002|publisher=Addison Wesley|title=Data Structures & Problem Solving using Java|id=ISBN 0-201-74835-5] demonstrates that this sequence allows a worst case sort, if the data is initially in the array as (small_1, large_1, small_2, large_2, ...) - that is, the upper half of the numbers are placed, in sorted order, in the even index locations and the lower end of the numbers are placed similarly in the odd indexed locations.Perhaps the most crucial property of Shellsort is that the elements remain k-sorted even as the gap diminishes. For instance, if a list was 5-sorted and then 3-sorted, the list is now not only 3-sorted, but both 5- and 3-sorted. If this were not true, the algorithm would undo work that it had done in previous iterations, and would not achieve such a low running time.
Depending on the choice of gap sequence, Shellsort has a proven worst-case running time of (using Shell's increments that start with 1/2 the array size and divide by 2 each time), (using Hibbard's increments of ), (using Sedgewick's increments of , or ), or (using Pratt's increments ), and possibly unproven better running times. The existence of an worst-case implementation of Shellsort was precluded by Poonen, Plaxton, and Suel [Cite journal|last=Poonen, Plaxton, Suel|title=Improved Lower Bounds for Shellsort|journal=Annual Symposium on Foundations of Computer Science|year=1992|issue=33|pages=226–235] .
The best known sequence is 1, 4, 10, 23, 57, 132, 301, 701, 1750. Such a Shell sort is faster than an
insertion sort and aheap sort ,vague|date=March 2008 but if it is faster than aquicksort for small arrays (less than 50 elements), it is slower for bigger arrays.Fact|date=March 2008 After 1750, gaps ingeometric progression can be used, such as: nextgShell sort algorithm in C/C++
The following is an implementation of shell sort written in C/
C++ for sorting anarray of integers. The increment sequence used in this example code gives an O(n2) worst-case running time.hell sort algorithm in Java
A Java implementation of Shell sort is as follows:
hell sort algorithm in Python
The function doesn't return the array to denote it mutates the input. That script prints:
References
External links
* [http://www.iti.fh-flensburg.de/lang/algorithmen/sortieren/shell/shellen.htm Detailed analysis of Shell sort]
* [http://www.cs.princeton.edu/~rs/shell/ Analysis of Shellsort and Related Algorithms, Robert Sedgewick]
* [http://www.research.att.com/~njas/sequences/A102549 Best known gap sequence]
* [http://vision.bc.edu/~dmartin/teaching/sorting/anim-html/shell.html A graphical demonstration and discussion of shell sort]
Wikimedia Foundation. 2010.