Icon (programming language)

Icon (programming language)

Infobox programming language
name = Icon

paradigm = multi-paradigm: structured, text-oriented
year = 1977
designer = Ralph Griswold
developer =
latest_release_version = 9.4.3
latest_release_date = November 14, 2005
typing = dynamic, weak
implementations = Icon
dialects = Unicon
influenced_by = SNOBOL, ALGOL
influenced = Lua, Python
website = [http://www.cs.arizona.edu/icon/ www.cs.arizona.edu/icon]

Icon is a very high-level programming language featuring "goal directed execution" and many facilities for managing strings and textual patterns. It is related to SNOBOL, a string processing language. Icon is not object-oriented, but an object-oriented extension called Idol was developed in 1996 which eventually became Unicon.

Basic syntax

The Icon language is derived from the ALGOL-class of structured programming languages, and thus has syntax similar to C or Pascal. Icon is most similar to Pascal, using := syntax for assignments, the procedure keyword and similar syntax. On the other hand, Icon uses C-style brackets for structuring execution groups, and programs start by running a procedure called "main".

In many ways Icon also shares features with most scripting programming languages (as well as Snobol from which they were taken): variables do not have to be declared, types are cast automatically, and numbers can be converted to strings and back automatically. Another feature common to many scripting languages, but not all, is the lack of a line-ending character; in Icon, lines not ended by a semicolon get ended by an implied semicolon if it makes sense.

Procedures are the basic building blocks of Icon programs, and although they use Pascal naming they work more like C functions and can return values; there is no function keyword in Icon.

procedure doSomething(aString) write(aString) end

Goal-directed execution

One of Icon's key concepts is that control structures are based on the "success" or "failure" of expressions, rather than on boolean logic, as in most other programming languages. Under this model, simple comparisons like if a < b do not mean "if the operations to the right evaluate to true" as they would under most languages; instead it means something more like "if the operations to the right "succeed". In this case the < operator succeeds if the comparison is true, so the end result is the same. In addition, the < operator returns its second argument if it succeeds, allowing things like if a < b < c, a common type of comparison that cannot be directly stated in most languages.

The utility of this concept becomes much clearer when you consider real-world examples. Since Icon uses success or failure for all flow control, this simple code:

if a := read() then write(a)

Will copy one line of the standard input to standard output. What's interesting about this example is that the code will work even if the read() causes an error, for instance, if the file does not exist. In that case the statement a := read() will fail, and write will simply not be called.

Success and failure are passed "up" through functions, meaning that a failure inside a nested function will cause the functions calling it to fail as well. For instance, we can write a program to copy an entire input file to output in a single line:

while write(read())

When the read() command fails, at the end of file for instance, the failure will be passed up the chain and write() will fail as well. The while, being a control structure, stops on failure, meaning it stops when the file is empty. For comparison, consider a similar example written in Java-based pseudocode:

try { while ((a = read()) != EOF) { write(a); } } catch (Exception e) { // do nothing, exit the loop }

In this case there are two comparisons needed, one for end of file (EOF) and another for all other errors. Since Java does not allow errors to be compared as logic elements, as under Icon, the lengthy try/catch syntax must be used instead. Try blocks also impose a performance penalty for simply using them, even if no error occurs, a distributed cost that Icon avoids.

Icon refers to this concept as "goal-directed execution", referring to the way that execution continues until some goal is reached. In the example above the goal is to read the entire file; the read command continues to succeed while there is more information to be read, and fails when there isn't. The goal is thus coded directly in the language, instead of using statements checking return codes or similar constructs.

Generators

Expressions in Icon often return a single value, for instance, x < 5 will evaluate and succeed with the value 5 or fail. However several of the examples below rely on the fact that many expressions do not "immediately" return success or failure, returning values in the meantime. This drives the examples with every and to; every causes to to continue to return values until it fails.

This is a key concept in Icon, known as "generators". Generators drive much of the loop functionality in the language, but do so more directly; the programmer does not write a loop and then pull out and compare values, Icon will do all of this for you.

Icon includes several generator-builders. The "alternator" syntax allows a series of items to be generated in sequence until one fails: 1 | "hello" | x < 5 can generate "1", "hello", and "5" if x is less than 5. Alternators can be read as "or" in many cases, for instance:

if y < (x | 5) then write("y=", y)

will write out the value of y if it is smaller than x "or" 5. Internally Icon checks every value from left to right until one succeeds or the list empties and it returns a failure. Remember that functions will not be called unless the calls within do not fail, so this example can be shortened to:

write("y=", (x | 5) > y)

Another simple generator is the to, which generates lists of integers; every write(1 to 10) will do exactly what it seems to. The "bang syntax" generates every item of a list; every write(!aString) will output each character of aString on a new line.

To demonstrate the power of this concept, consider string operations. Most languages include a function known as find or indexOf that returns the location of a string within another. Consider:

s = "All the world's a stage. And all the men and women merely players"; i = indexOf("the", s)

This code will return 4, the position of the first occurrence of the word "the". To get the next instance of "the" an alternate form must be used, i = indexOf("the", s, 5), the 5 at the end saying it should look from position 5 on. In order to extract all the occurrences of "the", a loop must be used...

s = "All the world's a stage. And all the men and women merely players"; i = indexOf("the", s) while i != -1 { write(i); i = indexOf("the", s, i+1); }

Under Icon the find function is a generator, and will return the next instance of the string each time it is resumed before finally failing after it passes the end of the string. The same code under Icon can be written:

s := "All the world's a stage. And all the men and women merely players" every write(find("the",s))

find will return the index of the next instance of "the" each time it is resumed by every, eventually passing the end of the string and failing. As in the prior example, this will cause write to fail, and the (one-line) every loop to exit.

Of course there are times where you deliberately want to find a string after some point in input, for instance, you might be scanning a text file containing data in multiple columns. Goal-directed execution works here as well, and can be used this way:

write(5 < find("the", s))

The position will only be returned if "the" appears after position 5, the comparison will fail otherwise, passing that failure to write() as before. There is one small "trick" to this code that needs to be considered: comparisons return the right hand result, so it is important to put the find on the right hand side of the comparison. If the 5 were placed on the right, 5 would be written.

Icon adds several control structures for looping through generators. The every operator is similar to while, looping through every item returned by a generator and exiting on failure:

every k := i to j do write(someFunction(k))

Why use every instead of a while loop in this case? Because while re-evaluates the first result,but every produces all results.The every syntax actually injects values into the function in a fashion similar to blocks under Smalltalk. For instance, the above loop can be re-written this way:

every write(someFunction(i to j))

Users can build new generators easily using the suspend keyword:

procedure findOnlyOdd(pattern, theString) every i := find(pattern, theString) do if i % 2 = 1 then suspend i end

This example loops over "theString" using find to look for "pattern". When one is found, and the position is even, the location is returned from the function with suspend. Unlike return, suspend writes down where it is in the internal generators as well, allowing it to pick up where it left off on the next iteration.

trings

In keeping with its script-like functionality, Icon adds a number of features to make working with strings easier. Most notable among these is the "scanning" system, which repeatedly calls functions on a string:

s ? write(find("the"))

is a short form of the examples shown earlier. In this case the "subject" of the find function is placed outside the parameters in front of the question-mark. Icon functions are deliberately (as opposed to automatically) written to identify the subject in parameter lists and allow them to be pulled out in this fashion.

Other structures

Icon strings are simply lists of characters, similar to their partners in C. Icon also allows the user to easily construct their own lists (or "arrays"):

aCat := ["muffins", "tabby", 2002, 8]

The items within a list can be of any sort, including other structures. To quickly build larger lists, Icon includes the list generator; i := list(10, "word") generates a list containing 10 copies of "word".

Like arrays in other languages, Icon allows items to be looked up by position; weight := aCat [4] . Also remember the bang-syntax, every write(!aCat) will print out four lines, each with one element. Icon includes stack-like functions, push and pop to allow them to form the basis of stacks and queues.

Icon also includes functionality for sets and tables (known as "hashes", "associative arrays", "dictionaries", etc.):

symbols := table(0) symbols ["there"] := 1 symbols ["here"] := 2

This code creates a table that will use zero as the default value of any unknown key. It then adds two items into it, with the keys "there" and "here", and values 1 and 2.

References

The definitive work is "The Icon Programming Language" (third edition) by Griswold and Griswold, ISBN 1-57398-001-3.It is out of print but can be [http://www.cs.arizona.edu/icon/books.htm downloaded] in PDF form.

Icon also has co-expressions, providing non-local exits for program execution. Please see "The Icon Programming language" and also Shamim Mohamed's [http://www.drones.com/coexp/ article] "Co-expressions in Icon". (This topic should probably be expanded).

See also

*Unicon programming language (a descendant)

External links

* [http://www.cs.arizona.edu/icon/index.htm Icon homepage]


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • Logo (programming language) — LOGO redirects here. For other uses, see LOGO (disambiguation). Logo Paradigm(s) multi paradigm:functional educational, procedural, reflective Appeared in …   Wikipedia

  • Python (programming language) — infobox programming language name = Python paradigm = multi paradigm: object oriented, imperative, functional year = 1991 designer = Guido van Rossum developer = Python Software Foundation latest release version = 2.6 latest release date =… …   Wikipedia

  • Unicon (programming language) — Unicon is a programming language descended from Icon and its preprocessor, IDOL, that offers better access to the operating system as well as support for object oriented programming. Unicon began life as a merger of three popular Icon extensions …   Wikipedia

  • Lua (programming language) — Infobox programming language name = Lua paradigm = Multi paradigm: scripting, imperative, functional year = 1993 designer = Roberto Ierusalimschy Waldemar Celes Luiz Henrique de Figueiredo developer = latest release version = 5.1.4 latest release …   Wikipedia

  • Visual programming language — A Visual programming language (VPL) is any programming language that lets users specify programs by manipulating program elements graphically rather than by specifying them textually. A VPL allows programming with visual expressions, spatial… …   Wikipedia

  • Converge (programming language) — Converge is a dynamic object oriented programming language with compile time meta programming facilities. Developed by the British computer scientist Laurence Tratt, Converge was originally intended for implementing model transformations in the… …   Wikipedia

  • Newspeak (programming language) — Newspeak Paradigm(s) object oriented, functional Appeared in 2006 Designed by Gilad Bracha Developer Gilad Bracha, Peter von der Ahé, Vassili Bykov, Yaron Kashai, William Maddox, Eliot Miranda Stable releas …   Wikipedia

  • Clarion (programming language) — For other uses, see Clarion. Clarion Developer(s) JPI, Clarion International, SoftVelocity Stable release 8.0 / August 30, 2011; 2 months ago (2011 08 30) Operating system …   Wikipedia

  • Linoleum (programming language) — Infobox programming language name = Linoleum paradigm = procedural year = 1996Verify source|date=July 2007 designer = Alessandro Ghignola developer = Alessandro Ghignola latest release version = 1.2 latest release date = 2007Clarifyme|date=March… …   Wikipedia

  • Godiva (programming language) — Godiva (GOal DIrected jaVA) is an extension to the java programming language supporting goal directed evaluation of expressions.ee also* Unicon * Icon * ConvergeExternal links * [http://www.cs.nmsu.edu/ jeffery/godiva/ Official Godiva website] *… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”