## Solution to Problem Set \# 10 (by Farinaz Koushanfar)

10.1
a) Gate 1 :

```
If (E1=0) }->\mathrm{ pull-down: ((A1.B1) + (C1.D1))'
    pull-up: (C1'+D1').(A1'+B1')
    pull-up: B1'.D1'+A1'.C1'
    If (E1=1)-> pull-down: ((B1+D1). (A1+C1))' pull-up;
    (after simplification)-> G1 = A1'.C1'+B1'.D1'+E1'.(B1'.C1'+A1'.D1')
If (D2=0) \(\rightarrow\) pull-down: (A2.B2)'
If (D2=1) \(\rightarrow\) pull-down: (A2. (B2+C2))'
\(\rightarrow \mathrm{G} 2=\mathrm{D} 2^{\prime} .\left(\mathrm{A}^{\prime}+\mathrm{B} 2^{\prime}\right)+\mathrm{D} 2 .\left(\mathrm{A} 2^{\prime}+\left(\mathrm{B}^{\prime} . \mathrm{C} 2^{\prime}\right)\right)\)
(after simplification) \(\rightarrow \mathrm{G} 2=\mathrm{A} 2^{\prime}+\mathrm{B} 2^{\prime} .\left(\mathrm{C} 2^{\prime}+\mathrm{D} 2^{\prime}\right)\)
```

Gate 2:
b) As can be seen in the answer to part a), the functions of the pull-up and pull-down parts are complimentary, which makes sure that the pull-up and pull-down paths are not simultaneously on for any given input. Also, the pull-up circuit is all PMOS, and the pull-down is all NMOS. The complimentary feature also says that the pull-up and pull-down oaths are not "on" at the same time, so in the steady state, there is no current flowing from the source to the ground (the output is just connected to the source or to the ground).
c) For going from high to low in the output, the pull-down circuit should discharge the current. The pulldown delay of an inverter is: $\tau_{\mathrm{HL}}($ inv $)=0.69 \mathrm{R}_{\mathrm{D}} \mathrm{C}$.
Gate1: $\quad$ The minimum pull-down time occurs, when ( $\mathrm{A} 1=\mathrm{B} 1=\mathrm{C} 1=\mathrm{D} 1=1$ ), $\mathrm{R}_{\mathrm{eq}}=2 \mathrm{R}_{\mathrm{D}} \| 2 \mathrm{R}_{\mathrm{D}}=\mathrm{R}_{\mathrm{D}}$
The max pull-down is when $\mathrm{E} 1=1$ and we have a path starting at one branch, and ending at the other, for e.g. $(\mathrm{A} 1=\mathrm{E} 1=\mathrm{Cl}=1), \mathrm{R}_{\mathrm{eq}}=3 \mathrm{R}_{\mathrm{D}}$
Thus, the range $1 \leq \tau_{\mathrm{HL}}($ gate 1$) / \tau_{\mathrm{HL}}($ inv $) \leq 3$.
Gate2: $\quad$ The minimum pull-down time occurs, when (A2 $=\mathrm{B} 2=\mathrm{C} 2=\mathrm{D} 2=1$ ), $\mathrm{R}_{\mathrm{eq}}=5 / 3 \mathrm{R}_{\mathrm{D}}$ The max pull-down is when ( $\mathrm{A} 2=\mathrm{C} 2=\mathrm{D} 2=1, \mathrm{~B} 2=0$ ), $\mathrm{R}_{\text {eq }}=3 \mathrm{R}_{\mathrm{D}}$ Thus, the range $1.66 \leq \tau_{\mathrm{HL}}($ gate 2$) / \tau_{\mathrm{HL}}($ inv $) \leq 3$.
d) The pull-up delay of an inverter is: $\tau_{\mathrm{LH}}($ inv $)=0.69 \mathrm{R}_{\mathrm{U}} \mathrm{C}$

Gate1: The min pull-up happens when both pull-up branches are on ( $\mathrm{A} 1=\mathrm{B} 1=\mathrm{Cl}=\mathrm{D} 1=0$ ), $\mathrm{R}_{\mathrm{eq}}=2 \mathrm{R}_{\mathrm{U}} \| 2 \mathrm{R}_{\mathrm{U}}=\mathrm{R}_{\mathrm{U}}$
The max pull-up is when there is only on path and that path passes E1 in the pull-up circuit, e.g. $(\mathrm{A} 1=\mathrm{E} 1=\mathrm{D} 1=0, \mathrm{~B} 1=\mathrm{C} 1=1) . \mathrm{R}_{\mathrm{eq}}=3 \mathrm{R}_{\mathrm{U}}$. Thus, the range is $1 \leq \tau_{\mathrm{LH}}($ gate 1$) / \tau_{\mathrm{LH}}($ inv $) \leq 3$.
Gate2: The minimum pull-up time occurs, when (A2=B2=C2=D2=0), $\mathrm{R}_{\text {eq }}=0.6 \mathrm{R}_{\mathrm{U}}$
The max pull-up is when ( $B 2=C 2=0, A 2=D 2=1$ ), $\mathrm{R}_{\text {eq }}=2 \mathrm{R}_{\mathrm{U}}$
Thus, the range $\quad 0.6 \leq \tau_{\mathrm{HL}}($ gate 2$) / \tau_{\mathrm{HL}}($ inv $) \leq 2$.
e) The overall worst-case delay for Gatel is the $\max \left(\tau_{\mathrm{LH}}(\right.$ gate 1$), \tau_{\mathrm{HL}}($ gate 1$\left.)\right)=3 \tau_{\mathrm{LH}}($ inv $)=3 \tau_{\mathrm{HL}}$ (inv)

The overall worst-case delay for Gate 2 is the $\max \left(\tau_{\mathrm{LH}}(\right.$ gate 2$), \tau_{\mathrm{HL}}($ gate 2$\left.)\right)=3 \tau_{\mathrm{LH}}(\mathrm{inv})$.

## 10.2

a) The best (or min ) delay happens when the output of gate 2 is independent of the values of A2 (A2 is connected to output of gate 1) that happens when the circuit is pulled up with the other. So, if we assume that $\mathrm{D} 2=\mathrm{C} 2=1$ for a long time, and then the input B2 changes from 1 to 0 , the delay for pulling up the output of the gate 2 becomes: $\tau_{\mathrm{LH}}($ gate 2$)$ for ( $\mathrm{B} 2=\mathrm{C} 2=\mathrm{D} 2=0$ ), regardless of A 2 , which is $1.5 \tau_{\mathrm{LH}}$ (inv).
b) The longest propagation time is:
$\max ($ delay $) /($ inv delay $))=\operatorname{Max}\left(\tau_{\mathrm{HL}}(\right.$ gate 1$) / \tau_{\mathrm{HL}}($ inv $)+\tau_{\mathrm{LH}}($ gate 2$) / \tau_{\mathrm{LH}}($ inv $\left.)\right)=3+3=6$ (inverter delays)
Assume that $(\mathrm{A} 1=\mathrm{B} 1=\mathrm{C} 1=\mathrm{D} 1=\mathrm{E} 1=0)$, then the inputs A 1 and B 1 change from 0 to 1 , causing the worst case HL transition at the output of gate1 ( 3 inverter delays). Assume that the inputs ( $\mathrm{B} 2=\mathrm{D} 2=\mathrm{C} 2=1$ ), then
the change in A2 (that is connected to the output of gate1 and goes to zero), will cause the worst case LH transition at the output of gate 2 (3 inverters delay).

## 10.3

a) Each of the latches, consist of two stages. The first stage of the latch is transparent when the clock is low $(\Phi=0)$ and has the delay of $\left(0.69 \times 2 \times R_{D} C\right)$ or $\left(0.69 \times 2 \times R_{U} C\right)$ depending on if it is going from $H$ to $L$ or vice versa. The second stage of the latch is transparent when clock goes high, and has the delay of $\left(0.69 \times 2 \times R_{U} C\right)$ or $\left(0.69 \times 2 \times R_{D} C\right)$ depending on the transition. Overall, since the outputs of the two stages are compliment, the overall delay of a latch is: $\left(0.69 \times 2 \times R_{U} C\right)+\left(0.69 \times 2 \times R_{D} C\right)=0.69 \times 2 \times C\left(R_{U}+R_{D}\right)$, which is 4 inverter delays.
b)

c)


## 10.4

a) Once the clock goes from low to high,

- Time for the output of L0 to change $=2 \tau($ inv $)$
- Worst case (max) for the output of gate 1 to change $=3 \tau(\mathrm{inv})$
- Worst case (max) for the output of gate 2 to change $=3 \tau$ (inv)

When clock goes from high to low,

- Time for transferring the data in the mid-point of the latch $=2 \tau($ inv $)$

Total (low to high) $=8 \tau$ (inv), total (high to low) $=2 \tau$ (inv)
"Note that the total delay is dependent on the clock cycle, for all the cases we have here!"
b) For the first pipeline stage, once the clock goes from low to high,

- Time for the output of L0 to change $=2 \tau($ inv $)$
- Worst case (max) for the output of gate 1 to change $=3 \tau(\mathrm{inv})$
- Time for transferring the data in the mid-point of the latch $\mathrm{L} 1=2 \tau(\mathrm{inv})$

Total (low to high) $=5 \tau$ (inv), total (high to low) $=2 \tau$ (inv)
For the second pipeline stage, once the clock goes from low to high,

- Time for the output of L1 to change $=2 \tau($ inv $)$
- Worst case (max) for the output of gate 2 to change $=3 \tau$ (inv)
- Time for transferring the data in the mid-point of the latch $\mathrm{L} 2=2 \tau(\mathrm{inv})$

Total $($ low to high $)=5(\mathrm{inv})$, total (high to low $)=2 \tau($ inv $)$
c) The latency for the overall logic evaluation, (assuming the clock has $\tau_{\mathrm{H}}($ clock $)$ and $\tau_{\mathrm{L}}($ clock $)$ )

For lumped logic $=\tau_{\mathrm{H}}($ clock $)+\tau_{\mathrm{L}}($ clock $)=\tau_{\mathrm{H}}($ clock $)+\tau_{\mathrm{L}}($ clock $)$
For pipelined logic $=2\left(\tau_{\mathrm{H}}(\right.$ clock $)+\tau_{\mathrm{L}}($ clock $\left.)\right)$
d) The lumped circuit (using the worst case calculations from 10.4(a)):
$\min \left\{\tau_{\mathrm{H}}(\right.$ clock $\left.)\right\}=8 \tau$ (inv) $\quad \min \left\{\tau_{\mathrm{L}}(\right.$ clock $\left.)\right\}=2 \tau$ (inv)
The pipelined circuit (using the worst case calculations from 10.4(a)):

$$
\min \left\{\tau_{\mathrm{H}}(\text { clock })\right\}=5 \tau(\text { inv }) \quad \min \left\{\tau_{\mathrm{L}}(\text { clock })\right\}=2 \tau \text { (inv) }
$$

Assuming minimum clock cycles, the overall delay of the G1 and G2 (for the output to be valid at the midpoint of last latch) is more with the pipelined circuit: $2\left(\tau_{\mathrm{H}}(\right.$ clock $)+\tau_{\mathrm{L}}($ clock $\left.)\right)=14 \tau(\mathrm{inv})$ as opposed to the lumped circuit: $\tau_{\mathrm{H}}($ clock $)+\tau_{\mathrm{L}}($ clock $)=10 \tau($ inv $)$. However, if we just assume the delay of one cycle of the pipelined implementation, it has the minimum $\tau_{\mathrm{H}}($ clock $)+\tau_{\mathrm{L}}($ clock $)=7 \tau(\mathrm{inv})$, which is $3 \tau(\mathrm{inv})$ faster than the lumped case, in terms of the clock cycle.

