State Encoding

- **One-hot encoding of states.**
- One FF per state.

**Ex: 3 States**

```
STATE1: 001
STATE2: 010
STATE3: 100
```

- **Why one-hot encoding?**
  - Simple design procedure.
    - Circuit matches state transition diagram (example next page).
    - Often can lead to simpler and faster “next state” and output logic.
- **Why not do this?**
  - Can be costly in terms of FFs for FSMs with large number of states.
- FPGAs are “FF rich”, therefore one-hot state machine encoding is often a good approach.
One-hot encoded FSM

- Even Parity Checker Circuit:

- In General:

  - FFs must be initialized for correct operation (only one 1)

One-hot encoded combination lock
FSM Implementation Notes

- General FSM form:

```
inputs  ->  outputs
present state  ->  next state
```

- All examples so far generate output based only on the present state:

- Commonly name **Moore Machine**
  (If output functions include both present state and input then called a **Mealy Machine**)

Finite State Machines

- **Example: Edge Detector**
  Bit are received one at a time (one per cycle), such as: 000111010 → \textit{time}
  
  Design a circuit that asserts its output for one cycle when the input bit stream changes from 0 to 1.

  Try two different solutions.
State Transition Diagram Solution A

<table>
<thead>
<tr>
<th>IN</th>
<th>PS</th>
<th>NS</th>
<th>OUT</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>00</td>
<td>00</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>00</td>
<td>01</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>01</td>
<td>00</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>01</td>
<td>11</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>11</td>
<td>00</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>11</td>
<td>11</td>
<td>0</td>
</tr>
</tbody>
</table>

ZERO

CHANGE

ONE

Solution A, circuit derivation

<table>
<thead>
<tr>
<th>IN</th>
<th>PS</th>
<th>NS</th>
<th>OUT</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>00</td>
<td>00</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>00</td>
<td>01</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>01</td>
<td>00</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>01</td>
<td>11</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>11</td>
<td>00</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>11</td>
<td>11</td>
<td>0</td>
</tr>
</tbody>
</table>

ZERO

CHANGE

ONE

\[ PS_{1} = \overline{PS_{0}} \]

\[ OUT = PS_{1}, PS_{0} \]
**Solution B**

Output depends not only on PS but also on input, IN

<table>
<thead>
<tr>
<th>IN</th>
<th>PS</th>
<th>NS</th>
<th>OUT</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
</tr>
</tbody>
</table>

Let ZERO=0, ONE=1

NS = IN, OUT = IN PS

What's the **intuition** about this solution?

---

**Edge detector timing diagrams**

- Solution A: output follows the clock
- Solution B: output changes with input rising edge and is asynchronous wrt the clock.
### FSM Comparison

**Solution A**

**Moore Machine**
- output function only of PS
- maybe more states (why?)
- synchronous outputs
  - no glitches
  - one cycle “delay”
  - full cycle of stable output

**Solution B**

**Mealy Machine**
- output function of both PS & input
- maybe fewer states
- asynchronous outputs
  - if input glitches, so does output
  - output immediately available
  - output may not be stable long enough to be useful (below):

If output of Mealy FSM goes through combinational logic before being registered, the CL might delay the signal and it could be missed by the clock edge.

---

### FSM Recap

**Moore Machine**
- input value
- inputs
- present state
- outputs
- CL
- FFs
- next state

**Mealy Machine**
- input value/output values
- inputs
- present state
- next state
- outputs
- CL
- FFs

*Both machine types allow one-hot implementations.*
Final Notes on Moore versus Mealy

1. A given state machine *could* have *both* Moore and Mealy style outputs. Nothing wrong with this, but you need to be aware of the timing differences between the two types.

2. The output timing behavior of the Moore machine can be achieved in a Mealy machine by “registering” the Mealy output values:

![Diagram of Mealy Machine]

General FSM Design Process with Verilog

Design Steps: **Implementation**

1. Specify circuit function (English)
2. Draw state transition diagram
3. Write down symbolic state transition table
4. Assign encodings (bit patterns) to symbolic states
5. Code as Verilog behavioral description
   - Use parameters to represent encoded states.
   - Use separate always blocks for register assignment and CL logic block.
   - Use case for CL block. Within each case section assign all outputs and next state value based on inputs. *Note: For Moore style machine make outputs dependent only on state not dependent on inputs.*
Counters

• Special sequential circuits (FSMs) that repeatedly sequence through a set of outputs.

• Examples:
  - binary counter: 000, 001, 010, 011, 100, 101, 110, 111, 000,
  - gray code counter:
    000, 010, 110, 100, 101, 111, 001, 000, 010, 110, ...
  - one-hot counter: 0001, 0010, 0100, 1000, 0001, 0010, ...
  - BCD counter: 0000, 0001, 0010, ..., 1001, 0000, 0001
  - pseudo-random sequence generators: 10, 01, 00, 11, 10, 01, 00, ...

• Moore machines with “ring” structure in State Transition Diagram:
What are they used?

- Counters are commonly used in hardware designs because most (if not all) computations that we put into hardware include iteration (looping). Examples:
  - Shift-and-add multiplication scheme.
  - Bit serial communication circuits (must count one "words worth" of serial bits).
- Other uses for counter:
  - Clock divider circuits
    \[ \text{16MHz} \div 64 \]
  - Systematic inspection of data-structures
    - Example: Network packet parser/filter control.
- Counters simplify “controller” design by:
  - providing a specific number of cycles of action,
  - sometimes used with a decoder to generate a sequence of timed control signals.
  - Consider using a counter when many FSM states with few branches.

Controller using Counters

- Example, Bit-serial multiplier (\(n^2\) cycles, one bit of result per \(n\) cycles):

  \[
  \text{shiftA, selectSum, shiftHI}
  \]
  \[
  \text{shiftB, shiftHI, shiftLOW, reset}
  \]

- Control Algorithm:
  \[
  \text{repeat n cycles} \{ \quad \text{// outer (i) loop} \\
  \quad \text{repeat n cycles} \{ \quad \text{// inner (j) loop} \\
  \quad \text{shiftA, selectSum, shiftHI} \\
  \}
  \]

  \[
  \text{shiftB, shiftHI, shiftLOW, reset}
  \]

Note: The occurrence of a control signal \(x\) means \(x=1\). The absence of \(x\) means \(x=0\).
Controller using Counters

- **State Transition Diagram:**
  - Assume presence of two binary counters. An “i” counter for the outer loop and “j” counter for inner loop.

TC is asserted when the counter reaches its maximum count value. CE is “count enable”. The counter increments its value on the rising edge of the clock if CE is asserted.

Controller using Counters

- **Controller circuit implementation:**

- **Outputs:**
  
  \[ \begin{align*}
  CE_i &= q_2 \\
  CE_j &= q_1 \\
  RST_i &= q_0 \\
  RST_j &= q_2 \\
  shiftA &= q_1 \\
  shiftB &= q_2 \\
  shiftLOW &= q_2 \\
  shiftHI &= q_1 + q_2 \\
  reset &= q_2 \\
  selectSUM &= q_1
  \end{align*} \]
How do we design counters?

• For binary counters (most common case) incrementer circuit would work:

• In Verilog, a counter is specified as: \( x = x+1; \)
  - This does not imply an adder
  - An incrementer is simpler than an adder
  - And a counter is simpler yet.

• In general, the best way to understand counter design is to think of them as FSMs, and follow general procedure, however some special cases can be optimized.

Synchronous Counters

All outputs change with clock edge.

• Binary Counter Design:
  
  Start with 3-bit version and generalize:

  \[
  \begin{array}{c|ccc|ccc}
  c & b & a & c' & b' & a' \\
  \hline
  0 & 0 & 0 & 0 & 0 & 1 \\
  0 & 0 & 1 & 0 & 1 & 0 \\
  0 & 1 & 0 & 0 & 1 & 1 \\
  0 & 1 & 1 & 1 & 0 & 0 \\
  1 & 0 & 0 & 1 & 0 & 1 \\
  1 & 0 & 1 & 1 & 1 & 0 \\
  1 & 1 & 0 & 1 & 1 & 1 \\
  1 & 1 & 1 & 0 & 0 & 0 \\
  \end{array}
  \]

  \[
  a' = a' \\
  b' = a + b \\
  c' = a'c + abc' + b'c \\
  = c(a' + b') + c'(ab) \\
  = c(ab)' + c'(ab) \\
  = c \oplus ab
  \]

  All outputs change with clock edge.
Synchronous Counters

- How do we extend to n-bits?
- Extrapolate \( c' = d' \oplus abc, \ e' = e \oplus abcd \)

\[
\begin{array}{c}
\text{a'} \rightarrow \text{b'} \rightarrow \text{c'} \rightarrow \text{d'} \\
\downarrow \quad \downarrow \quad \downarrow \\
\text{a} \quad \text{b} \quad \text{c} \quad \text{d}
\end{array}
\]

- Has difficulty scaling (AND gate inputs grow with n)

\[
\begin{array}{c}
\text{CE} \rightarrow \text{a'} \rightarrow \text{b'} \rightarrow \text{c'} \rightarrow \text{d'} \\
\downarrow \quad \downarrow \quad \downarrow \\
\text{a} \quad \text{b} \quad \text{c} \quad \text{d}
\end{array}
\]

- CE is “count enable”, allows external control of counting,
- TC is “terminal count”, is asserted on highest value, allows cascading, external sensing of occurrence of max value.

Synchronous Counters

- How does this one scale?
  \( \text{Delay grows } \alpha n \)
- Generation of TC signals very similar to generation of carry signals in adder.
  - “Parallel Prefix” circuit reduces delay:

\[
\begin{array}{c}
\text{a} \rightarrow \text{b} \rightarrow \text{c} \rightarrow \text{d} \\
\downarrow \quad \downarrow \quad \downarrow \\
\text{TC}_1 \quad \text{TC}_2 \quad \text{TC}_3 \quad \text{TC}_4
\end{array}
\]

\[
\begin{array}{c}
\log_2 n \\
\log_2 n
\end{array}
\]
**Up-Down Counter**

![Up-Down Counter Diagram]

Down-count

---

**Odd Counts**

- Extra combinational logic can be added to terminate count before max value is reached:
- Example: count to 12

```
reset 4-bit binary counter
        = 11 ?
```

- Alternative:

```
load 4-bit binary counter TC
```

---

Spring 2010 EECS150 - Lec22-counters Page 25
Ring Counters

- “one-hot” counters
  0001, 0010, 0100, 1000, 0001, ...

- What are these good for?

```
  reset
```

"Self-starting" version:

```
  D Q
  S R
```

Johnson Counter

(a) Four-stage switch-tail ring counter

<table>
<thead>
<tr>
<th>Sequence number</th>
<th>Flip-flop outputs</th>
<th>AND gate required for output</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>0 0 0 0</td>
<td>$A'E'$</td>
</tr>
<tr>
<td>2</td>
<td>1 0 0 0</td>
<td>$AB'$</td>
</tr>
<tr>
<td>3</td>
<td>1 1 0 0</td>
<td>$BC'$</td>
</tr>
<tr>
<td>4</td>
<td>1 1 1 0</td>
<td>$CE'$</td>
</tr>
<tr>
<td>5</td>
<td>1 1 1 1</td>
<td>$AE$</td>
</tr>
<tr>
<td>6</td>
<td>0 1 1 1</td>
<td>$A'B$</td>
</tr>
<tr>
<td>7</td>
<td>0 0 1 1</td>
<td>$B'C$</td>
</tr>
<tr>
<td>8</td>
<td>0 0 0 1</td>
<td>$CE$</td>
</tr>
</tbody>
</table>

(b) Count sequence and required decoding

Fig. 6-18  Construction of a Johnson Counter
Asynchronous “Ripple” counters

• Each stage is +2 of previous.
• Look at output waveforms:

• Often called “asynchronous” counters.
• A “T” flip-flop is a “toggle” flip-flop. Flips its state on cycles when T=1.

Forbidden in Synchronous Design

(a) With T flip-flops
(b) With D flip-flops

Fig. 6-8 4-Bit Binary Ripple Counter