Post correspondence problem

The Post correspondence problem is an undecidable decision problem that was introduced by Emil Post in 1946.^[1] Because it is simpler than the halting problem and the Entscheidungsproblem it is often used in proofs of undecidability.

1 Definition of the problem
2 Example instances of the problem
- 2.1 Example 1
- 2.2 Example 2
3 Proof sketch of undecidability
4 Variants
5 References
6 External links

Definition of the problem

The input of the problem consists of two finite lists $\alpha_{1}, \ldots, \alpha_{N}$ and $\beta_{1}, \ldots, \beta_{N}$ of words over some alphabet $A$ having at least two symbols. A solution to this problem is a sequence of indices $(i_k)_{1 \le k \le K}$ with $K \ge 1$ and $1 \le i_k \le N$ for all $k$ , such that

$\alpha_{i_1} \ldots \alpha_{i_K} = \beta_{i_1} \ldots \beta_{i_K}.$

The decision problem then is to decide whether such a solution exists or not.

Example instances of the problem

Example 1

Consider the following two lists:

α₁	α₂	α₃
a	ab	bba

β₁	β₂	β₃
baa	aa	bb

A solution to this problem would be the sequence (3, 2, 3, 1), because

α 3 α 2 α 3 α 1 = b b a + a b + b b a + a = b b a a b b b a a = b b + a a + b b + b a a = β 3 β 2 β 3 β 1 .

Furthermore, since (3, 2, 3, 1) is a solution, so are all of its "repetitions", such as (3, 2, 3, 1, 3, 2, 3, 1), etc.; that is, when a solution exists, there are infinitely many solutions of this repetitive kind.

However, if the two lists had consisted of only $α 2, α 3$ and $β 2, β 3$ , then there would have been no solution (because then no matching pair would have the same last letter, as must occur at the end of a solution).

A convenient way to view an instance of a Post correspondence problem is as a collection of blocks of the form

α_i
β_i

there being an unlimited supply of each type of block. Thus the above example is viewed as

a
baa

ab
aa

bba
bb

i = 1

i = 2

i = 3

where the solver has an endless supply of each of these three block types. A solution corresponds to some way of laying blocks next to each other so that the string in the top cells corresponds to the string in the bottom cells. Then the solution to the above example corresponds to:

bba
bb

ab
aa

bba
bb

a
baa

i₁ = 3

i₂ = 2

i₃ = 3

i₄ = 1

Example 2

Again using blocks to represent an instance of the problem, the following is an example that has infinitely many solutions in addition to the kind obtained by merely "repeating" a solution.

bb
b

ab
ba

c
bc

In this instance, every sequence of the form (1, 2, 2, ..., 2, 3) is a solution (in addition to all their repetitions):

bb
b

ab
ba

ab
ba

ab
ba

c
bc

Proof sketch of undecidability

The most common proof for the undecidability of PCP describes an instance of PCP that can simulate the computation of a Turing machine on a particular input. A match will only occur if the input would be accepted by the Turing machine. Because deciding if a Turing machine will accept an input is a basic undecidable problem, PCP cannot be decidable either. The following discussion is based on Michael Sipser's textbook Introduction to the Theory of Computation.^[2]

In more detail, the idea is that the string along the top and bottom will be a computation history of the Turing machine's computation. This means it will list a string describing the initial state, followed by a string describing the next state, and so on until it ends with a string describing an accepting state. The state strings are separated by some separator symbol (usually written #). According to the definition of a Turing machine, the full state of the machine consists of three parts:

The current contents of the tape.
The current state of the finite state machine which operates the tape head.
The current position of the tape head on the tape.

Although the tape has infinitely many cells, only some finite prefix of these will be non-blank. We write these down as part of our state. To describe the state of the finite control, we create new symbols, labelled q₁ through q_k, for each of the finite state machine's k states. We insert the correct symbol into the string describing the tape's contents at the position of the tape head, thereby indicating both the tape head's position and the current state of the finite control. For the alphabet {0,1}, a typical state might look something like:

101101110q₇00110.

A simple computation history would then look something like this:

q₀101#1q₄01#11q₂1#1q₈10.

We start out with this block, where x is the input string and q₀ is the start state:


q₀x#

The top starts out "lagging" the bottom by one state, and keeps this lag until the very end stage. Next, for each symbol a in the tape alphabet, as well as #, we have a "copy" block, which copies it unmodified from one state to the next:

a
a

We also have a block for each position transition the machine can make, showing how the tape head moves, how the finite state changes, and what happens to the surrounding symbols. For example, here the tape head is over a 0 in state 4, and then writes a 1 and moves right, changing to state 7:

q₄0
1q₇

Finally, when the top reaches an accepting state, the bottom needs a chance to finally catch up to complete the match. To allow this, we extend the computation so that once an accepting state is reached, each subsequent machine step will cause a symbol near the tape head to vanish, one at a time, until none remain. If q_f is an accepting state, we can represent this with the following transition blocks, where a is a tape alphabet symbol:

q_fa
q_f

aq_f
q_f

There are a number of details to work out, such as dealing with boundaries between states, making sure that our initial tile goes first in the match, and so on, but this shows the general idea of how a static tile puzzle can simulate a Turing machine computation.

Variants

Many variants of PCP have been considered. One reason is that, when one tries to prove undecidability of some new problem by reducing from PCP, it often happens that the first reduction one finds is not from PCP itself but from an apparently weaker version.

The condition that the alphabet $A$ have at least two symbols is required since the problem is decidable if $A$ has only one symbol.

A simple variant is to fix n, the number of tiles. This problem is decidable if n ≤ 2, but remains undecidable for n ≥ 7. It is unknown whether the problem is decidable for 3 ≤ n ≤ 6.

The circular Post correspondence problem asks whether indexes $i 1, i 2,...$ can be found such that $\alpha_{i_1} \cdots \alpha_{i_k}$ and $\beta_{i_1} \cdots \beta_{i_k}$ are conjugate words, i.e., they are equal modulo rotation. This variant is undecidable.^[3]

One of the most important variants of PCP is the bounded Post correspondence problem, which asks if we can find a match using no more than k tiles, including repeated tiles. A brute force search solves the problem in time O(2^k), but this may be difficult to improve upon, since the problem is NP-complete.^[4] Unlike some NP-complete problems like the boolean satisfiability problem, a small variation of the bounded problem was also shown to be complete for RNP, which means that it remains hard even if the inputs are chosen at random (it is hard on average over uniformly distributed inputs).^[5]

Another variant of PCP is called the marked Post Correspondence Problem, in which each u_i must begin with a different symbol, and each v_i must also begin with a different symbol. Halava, Hirvensalo, and de Wolf showed that this variation is decidable in exponential time. Moreover, they showed that if this requirement is slightly loosened so that only the two-character prefixes need to differ (the so-called 2-marked Post Correspondence Problem), the problem becomes undecidable again.^[6]

The Post Embedding Problem is another variant where one looks for indexes $i 1, i 2,...$ such that $\alpha_{i_1} \cdots \alpha_{i_k}$ is a (scattered) subword of $\beta_{i_1} \cdots \beta_{i_k}$ . This variant is easily decidable since, when some solutions exist, in particular a length-one solution exists. More interesting is the Regular Post Embedding Problem, a further variant where one looks for solutions that belong to a given regular language (submitted, e.g., under the form of a regular expression on the set ${1,..., N}$ ). The Regular Post Embedding Problem is still decidable but, because of the added regular constraint, it has a very high complexity that dominates every multiply recursive function.^[7]

References

^ E. L. Post (1946). "A variant of a recursively unsolvable problem". Bull. Amer. Math. Soc 52. http://www.ams.org/bull/1946-52-04/S0002-9904-1946-08555-9/S0002-9904-1946-08555-9.pdf.
^ Michael Sipser (2005). "A Simple Undecidable Problem". Introduction to the Theory of Computation (2nd ed.). Thomson Course Technology. pp. 199–205. ISBN 0-534-95097-3.
^ K. Ruohonen (1983). "On some variants of Post's correspondence problem". Acta Informatica (Springer) 19 (4): 357–367. doi:10.1007/BF00290732.
^ Michael R. Garey; David S. Johnson (1979). Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman. p. 228. ISBN 0-7167-1045-5.
^ Y. Gurevich (1991). "Average case completeness". J. Comp. Sys. Sci. (Elsevier Science) 42 (3): 346–398. doi:10.1016/0022-0000(91)90007-R.
^ V. Halava; M. Hirvensalo and R. de Wolf (2001). "Marked PCP is decidable". Theor. Comp. Sci. (Elsevier Science) 255: 193–204. doi:10.1016/S0304-3975(99)00163-2.
^ P. Chambart; Ph. Schnoebelen (2007). "Post embedding problem is not primitive recursive, with applications to channel systems". Lecture Notes in Computer Science. Lecture Notes in Computer Science (Springer) 4855: 265–276. doi:10.1007/978-3-540-77050-3_22. ISBN 978-3-540-77049-7.

External links

Eitan M. Gurari. An Introduction to the Theory of Computation, Chapter 4, Post's Correspondence Problem. A proof of the undecidability of PCP based on a generalization of context-free grammars.
Online PHP Based PCP Solver
PCP AT HOME

Categories:

Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

Post correspondence problem — Das Postsche Korrespondenzproblem (nach Emil Leon Post, abgekürzt auch PKP oder englisch PCP) ist ein Beispiel für ein unentscheidbares Problem in der Theoretischen Informatik. Es wird häufig verwendet, um mittels Reduktion die Unentscheidbarkeit … Deutsch Wikipedia
Correspondence problem — For the problem in theory of computation, see Post correspondence problem. The correspondence problem tries to figure out which parts of an image correspond to which parts of another image, after the camera has moved, time has elapsed, and/or the … Wikipedia
Post'sches Korrespondenzproblem — Das Postsche Korrespondenzproblem (nach Emil Leon Post, abgekürzt auch PKP oder englisch PCP) ist ein Beispiel für ein unentscheidbares Problem in der Theoretischen Informatik. Es wird häufig verwendet, um mittels Reduktion die Unentscheidbarkeit … Deutsch Wikipedia
Emil Leon Post — Infobox Scientist name = Emil Leon Post image width = birth date = February 11, 1897 birth place = Augustów, then Russian Empire death date = April 21 1954, death place = New York City, flagicon|USA U.S. residence = nationality = field =… … Wikipedia
Emil Post — Pour les articles homonymes, voir Post. Emil Leon Post Emil Leon Post (né le 11 février 1897 à Augustów et mort le 21 avril 1954 à New York) est un mathématicien … Wikipédia en Français
Undecidable problem — In computability theory and computational complexity theory, an undecidable problem is a decision problem for which it is impossible to construct an algorithm that leads to a yes or no answer the problem is not decidable.A decision problem is any … Wikipedia
Chess problem — Part of a series on Puzzles … Wikipedia
Demarcation problem — The demarcation problem (or boundary problem[1]) in the philosophy of science is about how and where to draw the lines around science. The boundaries are commonly drawn between science and non science, between science and pseudoscience, between… … Wikipedia
Co-operative Correspondence Club — The Cooperative Correspondence Club (CCC) was a group of approximately twenty four women, living all over the United Kingdom, who wrote to each other in the form of a private correspondence magazine from 1936 1990. Contents 1 Origin 2 Background… … Wikipedia
Proof of impossibility — A proof of impossibility, sometimes called a negative proof or negative result , is a proof demonstrating that a particular problem cannot be solved, or cannot be solved in general. Often proofs of impossibility have put to rest decades or… … Wikipedia

Academic Dictionaries and Encyclopedias

Post correspondence problem

Contents

Definition of the problem

Example instances of the problem

Example 1

Example 2

Proof sketch of undecidability

Variants

References

External links

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Post correspondence problem

Contents

Definition of the problem

Example instances of the problem

Example 1

Example 2

Proof sketch of undecidability

Variants

References

External links

Look at other dictionaries:

Share the article and excerpts

Direct link