- Interval arithmetic
Interval arithmetic, also called "interval mathematics", "interval analysis", and "interval computation", is a method in
mathematics . It has been developed by mathematicians since the 1950s and 1960s as an approach to putting bounds onrounding error s in mathematical computation and thus developingnumerical methods that yield very reliable results.Where classical arithmetic defines operations on individual numbers, interval arithmetic defines a set of operations on intervals:
:T · S = { "x" | there is some "y" in "T", and some "z" in "S", such that "x" = "y" · "z" }.
The basic operations of interval arithmetic are, for two intervals ["a", "b"] and ["c", "d"] that are subsets of the real line (-∞,∞) ,
* ["a","b"] + ["c","d"] = ["a" + "c", "b" + "d"]
* ["a","b"] − ["c", "d"] = ["a" − "d", "b" −"c"]
* ["a","b"] × ["c","d"] = [min ("ac", "ad", "bc", "bd"), max ("ac", "ad", "bc", "bd")]
* ["a","b"] / ["c","d"] = [min ("a/c", "a/d", "b/c", "b/d"), max ("a/c", "a/d", "b/c", "b/d")]Division by an interval containing zero is not defined under the basic interval arithmetic. The addition and multiplication operations are
commutative ,associative and sub-distributive : the set "X" ( "Y" + "Z" ) is a subset of "XY" + "XZ".Instead of working with an uncertain real we work with the two ends of the interval which contains : lies between and , or could be one of them. Similarly a function when applied to is also uncertain. Instead, in interval arithmetic produces an interval which is all the possible values for for all .
This concept is suitable, inter alia, for the treatment of rounding errors directly during the calculation and of uncertainties in the knowledge of the exact values of physical and technical parameters. The latter often arise from measurement errors and tolerances for components. Interval arithmetic also helps find reliable and guaranteed solutions to equations and optimization problems.
Take as an example the calculation of
body mass index (BMI). The BMI is the body weight in kilograms divided by the square of height in metres. Measuring the mass with bathroom scales may have an accuracy of one kilogram. We will not know intermediate values - about 79.6 kg or 80.3 kg - but information rounded to the nearest whole number. It is unlikely that you really weigh 80.0 kg exactly when it appears. In normal rounding to the nearest value, the scales showing 80 kg indicates a weight between 79.5 kg and 80.5 kg. The relevant range is that of all real numbers that are greater than or equal to 79.5, while less than or equal to 80.5, or in other words the interval [79.5,80.5] .For a man who weighs 80 kg and is 1.80 m tall, the BMI is about 24.7. With a weight of 79.5 kg and the same height the value is 24.5, while 80.5 kilograms gives almost 24.9. So the actual BMI is in the range [24.5,24.9] . The error in this case does not affect the conclusion (normal weight), but this is not always the position. For example, weight fluctuates in the course of a day so that the BMI can vary between 24 (normal weight) and 25 (overweight). Without detailed analysis it is no possible to always exclude questions as to whether an error ultimately is large enough to have significant influence.
Interval arithmetic states the range of possible outcomes explicitly. Simply put, results are no longer stated as numbers, but as intervals which represent imprecise values. The size of the intervals are similar to error bars to a metric in expressing the extent of uncertainty. Simple arithmetic operations, such as basic arithmetic and trigonometric functions, enable the calculation of outer limits of intervals.
Introduction
The main focus in the interval arithmetic is on the simplest way to calculate upper and lower endpoints for the the range of values of a function in one or more variables. These barriers need be not necessarily the
supremum orinfimum , since the precise calculation of those values are often too difficult; it can be shown that that task is in generalNP-hard .Treatment is typically limited to real intervals, so quantities of form:,where and are allowed; with one of then infinite we would have an unbounded interval, while with both infinite we would have the whole real number line.
As with traditional calculations with real numbers, simple arithmetic operations and functions on elementary intervals must first be defined (Lit.: Kulisch, 1989). More complicated functions can be calculated from these basic elements (Lit.: Kulish, 1989).
imple arithmetic
Returning to the earlier BMI example, in determining the body mass index, height and body weight both affect the result. For height, measurements are usually in round centimetres: a recorded measurement of 1.80 metres actually means a height somewhere between 1.795 m and 1.805 m. This uncertainty must be combined with the fluctuation range in weight between 79.5 kg and 80.5 kg. The BMI is defined as the weight in kilograms divided by the square of height in metre. Using either 79.5 kg and 1.795 m or 80.5 kg and 1.805 m gives approximately 24.7. But the person in question may only be 1.795 m tall, with a weight of 80.5 kilograms - or 1.805 m and 79.5 kilograms: all combinations of all possible intermediate values must be considered. Using the interval arithmetic methods described below, the BMI lies in the interval:
An operation on two intervals , with for example being addition or multiplication, is defined by
:.For the four basic arithmetic operations this can become:
provided that is allowed for all and .
For practical applications this can be simplified further:
*
Addition :
*Subtraction :
*Multiplication :
* Division: , where if . For division by an interval including zero, first define: and . For , we get which as a single interval gives ; this loses useful information about . So typically it is common to work with and as separate intervals.Because several such divisions may occur in an interval arithmetic calculation, it is sometimes useful to do the calculation with so-called "multi-intervals" of the form . The corresponding "multi-interval arithmetic" maintains a disjoint set of intervals and also provides for overlapping intervals to unite (Lit.: Dreyer, 2005).
Since a number can be interpreted as the interval , you can combine intervals and real numbers.
With the help of these definitions, it is already possible to calculate the range of simple functions, such as . If, for example, and , it is clear
:.
Interpreting this as a function of the variable with interval parameters and , them it is possible to find the roots of this function. It is then
:,the possible zeros are in the interval .
As in the above example, the multiplication of intervals often only requires two multiplications. It is in fact
:, if .
The multiplication can be see as a destination area of a rectangle with varying edges. The result interval covers all levels from the smallest to the largest.
The same applies when one of the two intervals is non-positive and the other non-negative. Generally, multiplication can produce results as wide as , for example if is squared. This also occurs, for example, in a division, if the numerator and denominator both contain zero.
Notation
To make the notation of intervals smaller in formulae, brackets can be used.
So we can use to represent an interval. For the set of all finite intervals, we can use :as an abbreviation. For a vector of intervals we can also used a bold font: .
In such a compact notation, you should note that should not be confused between a so-called improper or single point interval and the lower and upper limit.
Elementary functions
Interval methods can also apply to functions which do not just use simple arithmetic, and we must also use other basic functions for redefining intervals, using already known monotonicity properties.
For
monotonic function s in one variable, the range of values is also easy. If is monotonically rising or falling in the interval , then for all values in the interval such that , one of the following inequalities applies::, or .The range corresponding to the interval can be calculated by applying the function to the endpoints and ::.
From this the following basic features for interval functions can easily be defined:
*Exponential function : , for ,
*Logarithm : , for positive intervals and
* Odd powers: , for odd .For even powers, the range of values being considered is important, and needs to be dealt with before doing any multiplication.For example for should produce the interval when . But if you take by applying interval multiplication of form then the result will appear to be , wider than necessary.
Instead consider the function as a monotonically decreasing function for and a monotonically increasing function for . So for even :
* , if ,
* , if ,
* , otherwise.More generally, one can say that for piecewise monotonic functions it is sufficient to consider the endpoints of the interval , together with the so-called "critical points" within the interval being those points where the monotonicity of the function changes direction.
For the
sine andcosine functions, the critical points are at or for all respectively. Only up to five points matter as the resulting interval will be if at least half a period is in the input interval. For sine and cosine, only the endpoints need full evaluation as the critical points lead to easily pre-calculated values – namely -1, 0 , +1.Interval extensions of general functions
In general, it may not be easy to find such a simple description of the output interval for many functions. But it may still be possible to extend functions to interval arithmetic. If is a function from a real vector to a real number, then is called an "interval extension" of if:.
This definition of the interval extension does not give a precise result. For example, both and are allowable extensions of the exponential function. Extensions as tight as possible are desirable, taking into the relative costs of calculation and imprecision; in this case should be chosen as it give the tightest possible result.
The "natural interval extension" is achieved by combining the function rule with the equivalents of the basic arithmetic and elementary functions.
The "Taylor interval extension" (of degree ) is a times differentiable function defined by
:,for some , where is the th order differential of at the point and is an interval extension of the "Taylor remainder"
: The vector lies between and with , is protected by .Usually you choose to be the midpoint of the interval and use the natural interval extension to assess the remainder.
The special case of the Taylor interval extension of degree is also referred to as the "average interval extension".For an interval extension of the
Jacobian you get:.
A nonlinear function can be defined by linear features.
Interval methods
The methods of classical numerical analysis can not be transferred one-to-one into interval-valued algorithms, as dependencies between numerical values are usually not taken into account.
Rounded interval arithmetic
To working effectively in a real-life implementation, intervals must be compatible to floating point computing. The earlier operations were based on exact arithmetic, but in general fast numerical solution methods may not be available. The range of values of the function for and are for example . Where the same calculation is done with single digit precision, the result would normally be . But ,so this approach would contradict the basic principles of interval arithmetic contradict, as a part of the domain of would be lost.Instead, it is the outward rounded solution which is used.
The standard
IEEE 754 for binary floating-point arithmetic also sets out procedures for the implementation of rounding. An IEEE 754 compliant system allows programmers to round to the nearest floating point number; alternatives are rounding towards 0 (truncating), rounding toward positive infinity (i.e. up), or rounding towards negative infinity (i.e. down).The required "external rounding" for interval arithmetic can thus be achieved by changing the rounding settings of the processor in the calculation of the upper limit (up) and lower limit (down). Alternatively, an appropriate small interval can be added.
Dependency problem
The so-called "dependency problem" is a major obstacle to the application of interval arithmetic. Although interval methods can determine the range of elementary arithmetic operations and functions very accurately, this is not always true with more complicated functions. If an interval occurs several times in a calculation using parameters, and each occurrence is taken independently then this can lead to an unwanted expansion of the resulting intervals.
As an illustration, take the function defined by . The values of this function over the interval are really . As the natural interval extension, it is calculated as , which is slightly larger; we have instead calculated the infimum and supremum of the function over . There is a better expression of in which the variable only appears once, namely by rewriting as addition and squaring in the quadratic . So the suitable interval calculation is :and gives the correct values.
In general, it can be shown that the exact range of values can be achieved, if each variable appears only once. However, not every function can be rewritten this way.
The dependency of the problem causing over-estimation of the value range can go as far as covering a large range, preventing more meaningful conclusions.
An additional increase in the range stems from the solution of areas that do not take the form of an interval vector. The solution set of the linear system: for is precisely the line between the points and . Interval methods deliver the best case, but in the square , The real solution is contained in this square (this is known as the "wrapping effect").
Linear interval systems
A linear interval system consists of a matrix interval extension and an interval vector . We want the smallest cuboid , for all vectors which there is a pair with and satisfying:.
For quadratic systems - in other words, for - there can be such an interval vector , which covers all possible solutions, found simply with the interval Gauss method. This replaces the numerical operations, in that the linear algebra method known as Gaussian elimination becomes its interval version. However, since this method uses the interval entities and repeatedly in the calculation, it can produce poor results for some problems Hence using the result of the interval-valued Gauss only provides first rough estimates, since although it contains the entire solution set, it also has a large area outside it.
A rough solution can often be improved by an interval version of the
Gauss–Seidel method . The motivation for this is that the -th row of the interval extension of the linear equation:can be determined by the variable if the division is allowed. It is therefore simultaneously : and .So you can now replace by :, and so the vector by each element.Since the procedure is more efficient for adiagonally dominant matrix , instead of the system you can often try multiplying it by an appropriate rational matrix with the resulting matrix equation:left to solve. If you choose, for example, for the central matrix , then is outer extension of the identity matrix.These methods only work well if the widths of the intervals occurring are sufficiently small. For wider intervals it can be useful to use an interval-linear system on finite (albeit large) real number equivalent linear systems. If all the matrices are invertible, it is sufficient to consider all possible combinations (upper and lower) of the endpoints occurring in the intervals. The resulting problems can be resolved using conventional numerical methods. Interval arithmetic is still used to determine rounding errors.
This is only suitable for systems of smaller dimension, since with a fully occupied matrix, real matrices need to be inverted, with vectors for the right hand side. This approach was developed by Jiri Rohn and is still being developed. [ [http://www.cs.cas.cz/rohn/publist/000home.htm Jiri Rohn, List of publications] ]
Interval Newton method
An interval variant of
Newton's method for finding the zeros in an interval vector can be derived from the average value extension (Lit.: Hansen, 1992). For an unknown vector applied to , gives:.For a zero , that is , and thus must satisfy:.This is equivalent to .An outer estimate of can be determined using linear methods.In each step of the interval Newton method, an approximate starting value is replaced by and so the result can be improved iteratively. In contrast to traditional methods, the interval method approaches the result by containing the zeros. This guarantees that the result will produce all the zeros in the initial range. Conversely, it will prove that no zeros of were in the initial range if a Newton step produces the empty set.
The method converges on all zeros in the starting region. Division by zero can lead to separation of distinct zeros, though the separation may not be complete; it can be complemented by the bisection method.
As an example, consider the function , the starting range , and the point . You then have and the first Newton step gives :.There is therefore a zero in .More Newton steps are used separately on and . These converge to arbitrarily small intervals around and .
The Interval Newton method can also be used with "thick functions" such as , which would in any case have interval results. The result then produces intervals containing .
Bisection and covers
The various interval methods deliver conservative results as dependencies between the sizes of different intervals extensions are not taken into account. However the dependency problem becomes less significant for narrower intervals.
Covering an interval vector by smaller boxes so that is then valid for the range of values So for the interval extensions described above, is valid.Since is often a genuine
superset of the right-hand side, this usually leads to an improved estimate.Such a cover can be generated by the
bisection method such as thick elements of the interval vector by splitting in the centre into the two intervals and . It the result is still not suitable then further gradual subdivision is possible. Note that a cover of intervals results from divisions of vector elements, substantially increasing the computation costs.With very wide intervals, it can be helpful to split all intervals into several subintervals with a constant (and smaller) width, a method known as "mincing". This then avoids the calculations for intermediate bisection steps. Both methods are only suitable for problems of low dimension.
Application
Interval arithmetic can be use in various areas, in order to be treated estimates for which no exact numerical values can stated (Lit.: Jaulin et al., 2001).
Rounding error analysis
Interval arithmetic is used with error analysis, to control rounding errors arising from each calculation. The advantage of interval arithmetic is that after each operation there is an interval which reliably includes the true result. The distance between the interval boundaries gives the current calculation of rounding errors directly:: Error = for a given interval .Interval analysis adds to rather than substituting for traditional methods for error reduction, such as pivoting.
Tolerance analysis
Parameters for which no exact figures can be allocated often arise during the simulation of technical and physical processes. The production process of technical components allows certain tolerances, so some parameters fluctuate within intervals.In addition, many fundamental constants not are not known precisely (Lit.: Dreyer, 2005).
If the behavior of such a system affected by tolerances satisfies, for example, , for and unknown then the set of possible solutions :,can be found by interval methods. This provides an alternative to traditional
propagation of error analysis. Unlike point methods, such asMonte Carlo simulation , interval arithmetic methodology ensures that no part of the solution area can be overlooked.However, the result is always a worst case analysis for the distribution of error, as other probability-based distributions are not considered.Fuzzy interval arithmetic
Interval arithmetic can also be used with affiliation functions for fuzzy quantities as they are used in
fuzzy logic . Apart from the strict statements and , intermediate values are also possible, to which real numbers are assigned. corresponds to definite membership while is non-membership. A distribution function assigns uncertainty which can be understood as a further interval.For "fuzzy arithmetic" [ [http://www.iam.uni-stuttgart.de/Mitarbeiter/Hanss/hanss_en.htm Application of Fuzzy Arithmetic to Quantifying the Effects of Uncertain Model Parameters, Michael Hanss] ,
University of Stuttgart ] only a finite number of discrete affiliation stages are considered. The form of such a distribution for an indistinct value can then represented by a sequence of intervals:. The interval corresponds exactly to the fluctuation range for the stage .The appropriate distribution for a function concerning indistinct values and the corresponding sequences can be approximated by the sequence .The values are given by and can be calculated by interval methods. The value corresponds to the result of an interval calculation.
History
Interval arithmetic is not a completely new phenomenon in mathematics; it has appeared several times under different names in the course of history. For example Archimedes calculated lower and upper bounds 223/71 < π < 22/7 in the 3rd century BC.Actual calculations with intervals has neither been as popular as other numerical techniques, nor been completely forgotten.
Rules for calculating with intervals and other subsets of the real numbers were published in a 1931 work by Rosalind Cicely Young, a doctoral candidate at the University of Cambridge. Arithmetic work on range numbers to improve reliability of digital systems were then published in a 1951 textbook on linear algebra by Paul Dwyer (University of Michigan); intervals were used to measure rounding errors associated with floating-point numbers.
The birth of modern interval arithmetic was marked by the appearance of the book "Interval Analysis" by Ramon E. Moore in 1966 (Lit.: Moore). He had the idea in Spring 1958, and a year later he published an article about computer interval arithmetic. [ [http://interval.louisiana.edu/Moores_early_papers/bibliography.html Publications Related to Early Interval Work of R. E. Moore] ] . Its merit was that stating with a simple principle it provided is a general method for automated error analysis, not just errors resulting from rounding.
Independently in 1956, Mieczyslaw Warmus suggested formulae for calculations with intervals [ [http://www.ippt.gov.pl/~zkulpa/quaphys/warmus.html Precursory papers on interval analysis by M. Warmus] ] , though Moore found the first non-trivial applications.
In the following twenty years German groups of researchers carried out pioneering work around Götz Alefeld (Lit.: Alefeld and Herzberger) and Ulrich Kulisch (Lit.: Kulisch) at the University of Karlsruhe and later also at the Bergische University of Wuppertal. For example, Karl Nickel explored more effective implementations, while improved containment procedures for the solution set of systems of equations were due to Arnold Neumaier among others. [ [http://www.mat.univie.ac.at/~neum/publist.html Publications by Arnold Neumaier] ] . In the 1960s Eldon R. Hansen dealt with interval extensions for linear equations and then provided crucial contributions to global optimisation (Lit.: Hansen). Classical methods in this often are have the problem of determining the largest (or smallest) global value, but could only find a local optimum and could not find better values; Helmut Ratschek and Jon George Rokne developed
branch and bound methods, which till then had only applies to integer values, by using intervals to provide applications for continuous values [ [http://pages.cpsc.ucalgary.ca/~rokne/#SEC3 Some publications of Jon Rokne] ] .In 1988 Rudolf Lohner developed Fortran-based software for reliable solutions for initial value problems using
ordinary differential equations . [ [http://fam-pape.de/raw/ralph/studium/dgl/dglsem.html Bounds for ordinary differential equations of Rudolf Lohner] (in German)]The journal "Reliable Computing" (originally "Interval Computations") has been published since the 1990s , dedicated to the reliability of computer-aided computations. As lead editor, R. Baker Kearfott, in addition to his work on global optimisation, has contributed significantly to the unification of notation and terminology used in interval arithmetic (Web: Kearfott).
In recent years work has concentrated in particular on the estimation of
preimage s of parameterised functions and to robust control theory by the COPRIN working group ofINRIA inSophia Antipolis in France (Web: INRIA).Patents
One of the main sponsors of the interval arithmetic, G. William Walster of Sun Microsystems, has - in part with Ramon E. Moore and Eldon R. Hansen - lodged several patents in the field of interval arithmetic atthe U.S. Patent and Trademark Office in the years 2002-04 [ [http://www.mat.univie.ac.at/coconut-environment/#patents Patent Issues in Interval Arithmetic] ] . The validity of these patent applications have been disputed in the interval arithmetic research community, since they may possibly only show the past state of the art.
Implementations
There are many software packages which permit the development of numerical applications using interval arithmetic [ [http://www.cs.utep.edu/interval-comp/main.html Software for Interval Computations collected by Vladik Kreinovich] ,
University of Texas at El Paso ] .These are usually provided in the form of program libraries. [ [http://docs.sun.com/source/816-2465/iapgCusing.html#26326 C++ Interval Arithmetic Programming Reference] fromSun Microsystems ] There are also C++ and Fortran compilers handle interval data types and suitable operations as a language extension, [ [http://developers.sun.com/sunstudio/overview/topics/numerics_index.html C++ and Fortran compilers with Interval data types] fromSun Microsystems ] so interval arithmetic is supported directly.Since 1967 "Extensions for Scientific Computation" (XSC) have been developed in the University of Karlsruhe for various programming languages, such as C++, Fortran and Pascal. [ [http://www.math.uni-wuppertal.de/org/WRST/xsc/history.html History of XSC-Languages] ] The first platform was a Zuse Z 23, for which a new interval data type with appropriate elementary operators was made available. There followed in 1976 Pascal-SC, a Pascal variant on a Zilog Z80 which it made possible to create fast complicated routines for automated result verification. Then came the Fortran 77-based ACRITH XSC for the System/370 architecture, which was later delivered by IBM. Starting from 1991 one could produce code for C compilers with Pascal XSC; a year later the C++ class library supported C-XSC on many different computer systems. In 1997 all XSC variants were made available under the General Public License. At the beginning of 2000 C-XSC 2.0 was released under the leadership of the working group for scientific computation at the Bergische University of Wuppertal, in order to correspond to the improved C++ standard.
Another C++- class library was created in 1993 at the Hamburg University of Technology called "Profil/BIAS" (Programmer's Runtime Optimized Fast Interval Library, Basic Interval Arithmetic), which made the usual interval operations more user friendly. It emphasised the efficient use of hardware, portability and independence of a particular presentation of intervals.
The Boost collection of C++ libraries contains a template class for intervals. Its authors are aiming to have interval arithmetic in the standard C++ language. [ [http://www-sop.inria.fr/geometrica/team/Sylvain.Pion/cxx/ A Proposal to add Interval Arithmetic to the C++ Standard Library] ]
In addition computer algebra systems, such as Mathematica, Maple and MuPAD, can handle intervals. There is a Matlab extension "Intlab" which builds on BLAS routines, as well as the Toolbox b4m which makes a Profil/BIAS interface. [ [http://www.ti3.tu-harburg.de/~rump/intlab/ INTerval LABoratory] and [http://www.ti3.tu-harburg.de/zemke/b4m/ b4m] ] .
See also
*
Automatic differentiation
*Multigrid Method
*Monte-Carlo simulation Further information
Literature
* Götz Alefeld und Jürgen Herzberger: "Einführung in die Intervallrechnung". Bibliographisches Institut, Reihe Informatik, Band 12, B.I.-Wissenschaftsverlag, Mannheim - Wien - Zürich, ISBN 3-411-01466-0
* Alexander Dreyer: "Interval Analysis of Analog Circuits with Component Tolerances". Doctoral thesis, Shaker Verlag, Aachen, 2003, ISBN 3-8322-4555-3.
* Eldon Hansen and G. William Walster: "Global Optimization using Interval Analysis, Second Edition, Revised and Expanded", Marcel Dekker, New York, 2004, ISBN 0-8247-4059-9.
* L. Jaulin, M. Kieffer, O. Didrit, and É.Walter: "Applied Interval Analysis: With examples in parameter estimation robust control and robotics". Springer, London, 2001, ISBN 1-85233-219-0.
* Ulrich Kulisch: "Wissenschaftliches Rechnen mit Ergebnisverifikation. Eine Einführung", Vieweg-Verlag, Wiesbaden 1989, ISBN 3-528-08943-1.
* R. E. Moore: "Interval Analysis". Prentice-Hall, Englewood Cliff, NJ, 1966, ISBN 0-13-476853-1.External links
* [http://www.cs.utep.edu/interval-comp/hayes.pdf Brian Hayes, 'A Lucid Interval', a good introduction (pdf)]
* [http://www-sop.inria.fr/coprin/logiciels/ALIAS/Movie/movie_undergraduate.mpg Introductory Film (mpeg)] of the [http://www-sop.inria.fr/coprin/index_english.html COPRIN] teams ofINRIA ,Sophia Antipolis
* [http://interval.louisiana.edu/kearfott.html Bibliography of R. Baker Kearfott] ,University of Louisiana at Lafayette
* [http://www.mat.univie.ac.at/~neum/interval.html Interval Methods from Arnold Neumaier] ,University of Vienna References
Wikimedia Foundation. 2010.