8b/10b encoding

8b/10b encoding

In telecommunications, 8b/10b is a line code that maps 8-bit symbols to 10-bit symbols to achieve DC-balance (see DC coefficient) and bounded disparity, and yet provide enough state changes to allow reasonable clock recovery. This means that the difference between the count of 1s and 0s in a string of at least 20 bits is no more than 2, and that there are not more than five 1s or 0s in a row. This helps to reduce the demand for the lower bandwidth limit of the channel necessary to transfer the signal.


The code was described in 1983 by Al Widmer and Peter Franaszek in the IBM Journal of Research and Development. IBM was issued a patent for the scheme the following year. IBM's patent notwithstanding, the method, implementation and goals are very similar to Group Code Recording (GCR), used by IBM in its 3400 Series 6250 cpi 9-track tape drives introduced with its System/370 in 1970, by Apple in the floppy disk controller of its Apple II series introduced in 1978, and by Commodore in the floppy disk controller of the Commodore 2040 introduced in 1979.

Technologies that use 8b/10b

Now that the IBM patent has expired, the scheme has become even more popular and is the default DC-free line code for new standards.

Among the areas in which 8B/10B encoding finds application are
* PCI Express
* IEEE 1394b
* Serial ATA
* Fibre Channel
* Gigabit Ethernet (except for the twisted pair based 1000Base-T)
* InfiniBand
* Serial RapidIO
* DVI and HDMI (Transition Minimized Differential Signaling)
* DVB Asynchronous Serial Interface (ASI)
* HyperTransport
* Common Public Radio Interface (CPRI)
* USB 3.0

Digital audio

The encoding has found a heavy use in digital audio applications which use this modulation scheme:
* Digital Audio Tape
* Digital Compact Cassette (DCC).A differing but related scheme is used for audio and data CDs:
* Compact Disc Eight-to-Fourteen Modulation


For 10 Gigabit Ethernet's 10GBASE-R Physical Medium Dependent (PMD) interfaces, 64b/66b encoding is used. This scheme is considerably different in design to 8b/10b encoding, but was created with similar considerations of DC balance, maximum run length, transition density and electromagnetic emission minimisation.

Note that 8b/10b is the encoding scheme, not a specific code. While many applications do use the same code, there exist some incompatible implementations; for example, Transition Minimized Differential Signaling, which also expands 8 bits to 10 bits, has some subtle differences.

How it works

As the scheme name suggests, 8 bits of data are transmitted as a 10-bit entity called a "symbol", or "character". The low 5 bits of data are encoded into a 6-bit group and the top 3 bits are encoded into a 4-bit group. These code groups are concatenated together to form the 10-bit symbol that is transmitted on the wire. The "data symbols" are often referred to as D.x.y where x ranges from 0-31 and y from 0-7. Standards using the 8b/10b encoding also define up to 12 "special symbols" (or "control characters") that can be sent in place of a "data symbol". They are often used to indicate start-of-frame, end-of-frame, link idle, skip and similar link-level conditions. At least one of them (i.e. a "comma" symbol) needs to be used to define the alignment of the 10 bit symbols. They are referred to as K.x.y and have different encodings from any of the D.x.y symbols.

Because 8b/10b encoding uses 10-bit symbols to encode 8-bit words, some of the possible 1024 codes can be excluded to grant a run-length limit of 5 consecutive equal bits and grant that the difference of the count of 0s and 1s is no more than 2. Some of the 256 possible 8-bit words can be encoded in two different ways. Using these alternative encodings, the scheme is able to effect long-term DC-balance in the serial data stream. This permits the data stream to be transmitted through a channel with a high-pass characteristic, for example Ethernet's transformer-coupled unshielded twisted pair or optical receivers using automatic gain control.

The encoding is normally done entirely in hardware. Upper layers of the software stack should be "unaware" that this encoding is being used.

Encoding tables

Note that in the following tables, "A" and "a" are the least significant bit. The bits are sent low to high: a → b → c → d → e → i → f → g → h → j (i.e. the 5b/6b code followed by the 3b/4b code). With that the uniqueness of the special bit sequence in the comma codes is granted.

The residual effect on the stream to the number of zero and one bits transmitted is maintained as the Running Disparity (RD) and the effect of slew is balanced by the choice of encoding for following symbols.

When a 6 or 4 bit code is used that has a non-zero disparity (count of '1' bits minus count of '0' bits, i.e. -2 or +2), the choice of positive or negative disparity encodings must be the one that toggles the running disparity. I.e. the non zero disparity codes alternate (disregarding if 5b/6b or 3b/4b codes).

Running Disparity

Running Disparity is a concept used in the 8b/10b encoding to keep the number of 1s and 0s that are transmitted "down the wire" roughly equal.

This scheme only needs two states for Running Disparity of +1 and -1. It starts at -1 [cite web |title=Thoughts on Gigabit Ethernet Physical |date=1996-04-01 |url=http://grouper.ieee.org/groups/802/3/z/public/presentations/mar1996/JTtgep.txt|author=Thatcher, Jonathan|work=IBM|accessdate=2008-08-17]

For each 5b/6b and 3b/4b code with an unequal number of 1s and 0s, there are two bit patterns that can be used to transmit it: One with two more 1 bits and one with all bits inverted and thus two more 0s. Depending on the current running disparity of the signal, the encoding engine selects which of the two possible 6 or 4 bit sequences to send for the given data. (Obviously, if the 6 or 4 bit code has equal numbers of 1s and 0s, there is no choice to make, as the disparity would be unchanged.)

† For D.x.7, the Primary (D.x.P7) or Alternate (D.x.A7) encoding must be selected in order to avoid a run of five consecutive 0s or 1s when combined with the preceding 5b/6b code. Sequences of five identical bits are used in comma codes for synchronization issues. D.x.A7 is only used for x=17, x=18, and x=20 when RD=-1 and for x=11, x=13, and x=14 when RD=+1. With x=23, x=27, x=29, and x=30, the same code forms the control codes K.x.7. Any other x.A7 code can't be used as it would result in chances for misaligned comma sequences.

‡ The alternate encoding for the K.x.y codes with disparity 0 allow for K.28.1, K.28.5, and K.28.7 to be "comma" codes that contain a bit sequence that can't be found elsewhere in the data stream.

Control symbols

The control symbols within 8b/10b are 10b bytes that are valid sequences of bits (no more than six 1s or 0s) but do not have a corresponding 8b data byte. They are used for low-level control functions. For instance, in Fibre Channel, K28.5 is used at the beginning of four-byte sequences (called "Ordered Sets") that perform functions such as Loop Arbitration, Fill Words, Link Resets, etc.

Resulting from the 5b/6b and 3b/4b tables the following 12 control symbols are allowed to be sent:

† Within the control symbols, K.28.1, K.28.5, and K.28.7 are "comma symbols". Comma symbols are used for synchronization (finding the alignment of the 8b/10b codes within a bit-stream). If K.28.7 is not used, the unique comma sequences 0011111 or 1100000 cannot be found at any bit position within any combination of normal codes.

‡ If K.28.7 is allowed in the actual coding, a more complex definition of the synchronization pattern than suggested by † needs to be used, as a combination of K.28.7 with several other codes forms a false misaligned comma symbol overlapping the two codes. A sequence of multiple K.28.7 codes is not allowable in any case, as this would result in undetectable misaligned comma symbols.

K.28.7 is the only comma symbol that cannot be the result of a single bit error in the data stream.

External links

* [http://domino.research.ibm.com/tchjr/journalindex.nsf/0/b4e28be4a69a153585256bfa0067f59a?OpenDocument Original paper by Franaszek and Widmer] .
*US patent|4486739 (expired June 30, 2002)


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Fibre Channel 8B/10B encoding — The Fibre Channel FC1 data link layer implements the 8b/10b encoding and decoding of signals.The Fibre Channel 8B/10B coding scheme is also used in other telecommunications systems. Data is expanded using an algorithm that creates one of two… …   Wikipedia

  • 64b/66b encoding — In data networking and transmission, 64B/66B is a line code that transforms 64 bit data to 66 bit line code to achieve DC balance and bounded disparity, and yet provide enough state changes to allow reasonable clock recovery. This means that… …   Wikipedia

  • Bipolar encoding — An example of bipolar encoding, or AMI (Alternate mark inversion). In telecommunication, bipolar encoding is a type of line code (a method of encoding digital information to make it resistant to certain forms of signal loss during transmission).… …   Wikipedia

  • Delay encoding — In telecommunications, delay encoding is the encoding of binary data to form a two level signal such that (a) a 0 causes no change of signal level unless it is followed by another 0 in which case a transition to the other level takes place at the …   Wikipedia

  • Differential Manchester encoding — Differential Manchester encoding, also called biphase mark code (BMC) or FM1, is a line code in which data and clock signals are combined to form a single 2 level self synchronizing data stream. It is a differential encoding, using the presence… …   Wikipedia

  • 6b/8b encoding — In telecommunications, 6b/8b is a line code that expands 6 bit codes to 8 bit symbols for the purposes of maintaining DC balance in a communications system. See also * 8b/10b encoding, another fixed table system. * 64b/66b encoding, linear… …   Wikipedia

  • Codage 8b/10b — Le codage 8b/10b (le terme encodage est impropre et influencé par l anglais encoding ) consiste à coder, à l aide d une table de correspondance, une série de 8 bits en un symbole de transmission de 10 bits (appelé Transmission Character). Il y a… …   Wikipédia en Français

  • Encodage 8b/10b — Codage 8b/10b Le codage 8b/10b (le terme encodage est impropre et influencé par l anglais encoding ) consiste à coder, à l aide d une table de correspondance, une série de 8 bits en un symbole de transmission de 10 bits (appelé Transmission… …   Wikipédia en Français

  • 8B/10B — abbr. 8 bit / 10 bit (encoding) (IBM, FC, BIT) Syn: 8B10B …   United dictionary of abbreviations and acronyms

  • JESD204 — JEDEC JESD204A (2008) is a new industry standard for the interconnection of digital to analog converter (DAC or D to A) or analog to digital converter (abbreviated ADC, A/D or A to D) and logic devices which supports the concept of multiple… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”