# 4B3T PAM3 PCS proposal

Tingting Zhang Huawei Technologies

1

# Overview of PCS architecture

- PCS for both 500m (long-reach mode) and motion control (low-latency mode) uses 10BASE-T1L 4B3T PAM3, achieving bounded disparity.
- The PCS overhead in both modes is 6.67%, giving the same symbol rate (80MBaud).



#### PCS transmit bit ordering (reverse order for receive)

- Long-reach mode with FEC: 1 bit is inserted every 15 65b blocks before RS (128,122,3,8) encoder. The FEC frame is scrambled before 4B3T mapping.
- Low-latency mode without FEC: 1 bit is inserted every 15 17b blocks before scrambling and 4B3T coding. Tx and Rx PCS latency is about 200ns.



# PCS block coding

- As also presented in Lo 3dg 01 012524, 64B/65B and 16B/17B with similar coding principle as 80B/81B are used for long reach and low latency application, respectively.
- Since the control octet only appears in octet 1-8 for 64B/65B (N=8), the position pointer can be shorted to 3 bits (i.e. NEXT (n) [0:2]). This allows more control characters (i.e. TD [n] [4:7]) to be transmitted.
- 16B/17B for motion control application minimizes coding latency with acceptable overhead.

| The 80B/81B block en | acoding is defined by the following pseudo-code, where $N = 10$ .          |
|----------------------|----------------------------------------------------------------------------|
|                      | II octets encoded into block. Octets numbered n = $0, 1, 2,, N-1$ .        |
|                      | irst one presented on GMII.                                                |
|                      | et n is data octet on GMII, 1 if octet n is control octet on GMII          |
| TC[-1] = 1 by def    |                                                                            |
|                      | [ octet n TXD[0:7] if TC[n] = 0                                            |
|                      | - IPG (loc_phy_ready = OK), 101 - LPI, 001 - TX Error, 000 - IPG           |
|                      | NOT OK) if $TC[n] = 1$ . $TD[n][0:4]$ is undefined.                        |
|                      | N+1 block. Bit 0 transmitted first.                                        |
| OR(n) = Bitwise C    |                                                                            |
|                      | t position of lowest bit in TC[n:N-1] that is a 1. Bit 3 is MSB.           |
| NEXT(n)[4] = 0 if    | Bitwise SUM of TC[n:N-1] = 1, else 1                                       |
| D[0]                 |                                                                            |
| B[0] =               | OR(0)                                                                      |
| B[8n+1:8n+4] =       | TD[n][0:3] - if OR(n) = 0<br>NEXT(n)[0:3] - if OR(n) = 1 AND $TC[n-1] = 1$ |
|                      | TD[n-1][3:6] - if OR(n) = 1 AND TC[n-1] = 0                                |
| D[0-15]              |                                                                            |
| B[8n+5] =            | TD[n][4] - if OR(n) = 0                                                    |
|                      | NEXT (n) [4] $-$ if OR (n) $=$ 1 AND TC [n-1] $=$ 1                        |
| D[0-16.0-10]         | TD[n-1][7] - if OR(n) = 1 AND TC[n-1] = 0                                  |
| B[8n+6:8n+8] =       |                                                                            |
|                      | TD[n][5:7] - if OR(n) = 1 AND TC[n] = 1                                    |
|                      | TD[n][0:2] - if OR(n) = 1 AND TC[n] = 0                                    |
|                      |                                                                            |

Source: Clause 97.3.2.2.4

#### **RS-FEC** encoder

- RS (128, 122) in GF(2<sup>8</sup>) field with 6 parity symbols is only used for long-reach transmission. No FEC is used for low-latency mode.
  - FEC frame consisting of 122 message and 6 parity symbols, has duration of 9.6us.
  - 976 message bits is composed of 15 65B blocks and 1 auxiliary bit. The auxiliary bit can be used for vendor-specific communication (e.g. OAM).
  - The RS encoder provides 3.79dB net coding gain (4-10\*log<sub>10</sub>128/122) and tolerates 225ns burst. The BER threshold is 2.78e-5 for a given output BER of 1e-10.



# Scrambling

- Data mode for both long-reach and low-latency PCS uses 10BASE-T1L scrambler. The generator polynomial for Master and Slave are:
  - Master:  $g_{M}(x) = 1 + x^{13} + x^{33}$
  - Slave:  $g_s(x) = 1 + x^{20} + x^{33}$
- Training mode can also employ the same scrambler by resorting to reliable synchronization.

## 4B3T line coding

- Reuse 10BASE-T1L line coding 4B3T. 3T tuple has 17 different combinations in total.
- The redundant code group COMMA (0,0,0) is not required.
- The residual 16 code groups are used to carry 4 bit information, while achieving bounded disparity.

| 3T code group |        |        |        |       |  |
|---------------|--------|--------|--------|-------|--|
| Sum 0         | Sum ±1 | Sum ±2 | Sum ±3 | Total |  |
| 7             | 6      | 3      | 1      | 17    |  |



#### Summary

- The proposed PCS for low latency and long reach uses 10BASE-T1L 4B3T PAM3 and results in the same symbol rate of 80MBaud, simplifying PMA design and reducing PHY cost.
- For 500m transmission, RS(128,122) is used after 64B/65B coding. The RS encoder provides 3.75dB NCG and 225ns burst error correction capability, ensuring reliable communication.
- For latency-sensitive application, FEC is not used. PCS transmit performs simple functions. Overall PCS latency is less than 330ns.
- Can we achieve any PHY baseline (block coding, FEC, scrambling, and line coding) from this proposal?

# Q & A