- Chomsky–Schützenberger theorem
-
In formal language theory, the Chomsky–Schützenberger theorem is a statement about the number of words of a given length generated by an unambiguous context-free grammar. The theorem provides an unexpected link between the theory of formal languages and abstract algebra. It is named after Noam Chomsky and Marcel-Paul Schützenberger.
Statement of the theorem
In order to state the theorem, we need a few notions from algebra and formal language theory.
A power series over is an infinite series of the form
with coefficients ak in . The multiplication of two formal power series f and g is defined in the expected way as the convolution of the sequences an and bn:
In particular, we write , , and so on. In analogy to algebraic numbers, a power series f(x) is called algebraic over , if there exists a finite set of polynomials each with rational coefficients such that
Having established the necessary notions, the theorem is stated as follows.
- Chomsky–Schützenberger theorem. If L is a context-free language admitting an unambiguous context-free grammar, and is the number of words of length k in L, then is a power series over that is algebraic over .
Proofs of this theorem are given by Kuich & Salomaa (1985), and by Panholzer (2005).
References
- Chomsky, Noam & Schützenberger, Marcel-Paul: "The Algebraic Theory of Context-Free Languages," in Computer Programming and Formal Systems, P. Braffort and D. Hirschberg (eds.), North Holland, pp. 118–161, 1963. Available online.
- Kuich, Werner & Salomaa, Arto: Semirings, Automata, Languages. Springer, 1985.
- Panholzer, Alois: "Gröbner Bases and the Defining Polynomial of a Context-free Grammar Generating Function." Journal of Automata, Languages and Combinatorics 10:79–97, 2005.
Categories:- Noam Chomsky
- Formal languages
- Theorems in discrete mathematics
Wikimedia Foundation. 2010.