## EE466

## VLSI Design

Final Exam
Dec. 13, 2019. (3:10pm - 5:10pm)
Instructor: Dae Hyun Kim (daehyun@eecs.wsu.edu)

| Name: |  |
| :---: | :---: | :--- |
| WSU ID: |  |

* Allowed: Textbooks, cheat sheets, class notes, notebooks, calculators, watches, electronic devices.
* Not allowed: Chat apps.


## Problem \#1 (Sequential Logic, 10 points)

The left one is a positive-edge-triggered explicit-pulsed D flip-flop (epDFF). The right one is a negative-edge-triggered epDFF.


Do the two FFs have the same hold-time constraint for $\mathrm{D}=0$ ?

Do the two FFs have the same hold-time constraint for $\mathrm{D}=1$ ?

## Problem \#2 (Domino Logic, 10 points)

The following shows a three-stage domino logic for $Y=N 1 \cdot N 2 \cdot N 3$ (the inputs to the NFET networks are not shown). The sizes of all the PFETs and the inverters are fixed (constants). All the given timing constraints are also fixed.


Now, we merge the PFETs into a single PFET as follows:


1) Will it work? If no, explain why. If yes, can you compare the size of the PFET in the second schematic and the sum of the sizes of the PFETs in the first schematic?

Now, let's merge the NFETs into a single NFET as follows:

2) Will it work? If no, explain why. If yes, can you compare the size of the clock NFET in the third schematic and the sum of the sizes of the clock NFETs in the first schematic?

## Problem \#3 (Sequential Logic, 10 points)

The following shows a schematic of a D-F/F. Estimate the hold time constraint of the F/F (for example, "one inverter delay + one transmission gate delay").


## Problem \#4 (Carry Select Adder, 10 points)

The following shows a schematic of a $2 k$-bit carry select adder designed using $k$-bit parallel adders. The delay of a $k$-bit adder is $\Delta_{F A} \cdot \log _{2} k$, the delay of a $k$-bit MUX is $\Delta_{M}$, and the delay of a two-input AND (or OR) gate is $\Delta_{M}$.


We are supposed to design an $N$-bit adder using carry select adders (\# groups: $\frac{N}{k}$ ). Find $k$ minimizing the delay of the $N$-bit adder (express the optimal $k$ as a function of $N, \Delta_{M}$, and $\Delta_{F A}$ ). Notice that the worst-case delay occurs at $C_{N}$ (the final carry out) or $S_{N-1: 0}$ (the final sum). Just a small math hint: $\log _{2} k=\frac{\ln k}{\ln 2}, \frac{d(\ln x)}{d x}=\frac{1}{x}$.

## Problem \#5 (Carry Select Adder, 10 points)

To radically improve the delay of a carry-select adder, we design an $N$-bit carry-select adder as follows:


We design the $k$-bit adder using a $k$-bit parallel adder where $k$ is $2 m^{i}$ ( $i$ is an integer greater than or equal to 0 ).

The following shows the delays of the components:

- $k$-bit adder: $\Delta_{F A} \cdot \log _{2} k=\Delta_{F A} \cdot \log _{2}\left(2 m^{i}\right)=\Delta_{F A} \cdot\left(1+i \cdot \log _{2} m\right)$
- $k$-bit MUX: $\Delta_{M}$
- $\Delta_{F A}>2 * \Delta_{M}$

Express the delay of the last carry out $C_{N}$ using $N, m, \Delta_{F A}$, and $\Delta_{M}$.

## Problem \#6 (Carry Skip Adder, 20 points)

The following diagram shows a 16-bit carry-skip adder designed using 4-bit adders.


1) Show the details of the calculations in the carry skip adder for the following inputs (fill in the blanks).

$$
\begin{gathered}
A=0101010101010101 \\
B=1010101010101010 \\
C_{0}=1
\end{gathered}
$$

Step 1)
$[3: 0] g_{0}=\ldots \cdot g_{1}=\ldots \cdot g_{2}=\ldots \cdot g_{3}=\ldots \cdot p_{0}=\ldots \cdot p_{1}=\ldots \cdot p_{2}=\ldots \cdot p_{3}=\ldots$.
[7:4] $g_{4}=$ $\qquad$ -. $g_{5}=$ $\qquad$ - $g_{6}=$ $\qquad$ ._. $g_{7}=$ $\qquad$ . $p_{4}=$ $\qquad$ -. $p_{5}=$ $\qquad$ . $p_{6}=$ $\qquad$ . $p_{7}=$ $\qquad$
$[11: 8] g_{8}=$ $\qquad$ . $g_{9}=\ldots \cdot g_{10}=$ $\qquad$ $g_{11}=$ $\qquad$ . $p_{8}=\ldots \cdot p_{9}=$ $\qquad$ . $p_{10}=$ $\qquad$ - $p_{11}=$ _ . [15:12] $g_{12}=$ $\qquad$ - $g_{13}=$ $\qquad$ . $g_{14}=$ $\qquad$ . $g_{15}=$ $\qquad$ $p_{12}=\ldots \cdot p_{13}=$ $\qquad$ - $p_{14}=$ $\qquad$ -
$p_{15}=\ldots$.

Step 2)
[3:0] $g_{1: 0}=$ $\qquad$ $g_{2: 0}=\ldots \cdot p_{2: 0}=$ $\qquad$ - $g_{3: 0}=$ $\qquad$ . $p_{3: 0}=$ $\qquad$ —.
$[7: 4] g_{5: 4}=$ $\qquad$ . $p_{5: 4}=$ $\qquad$ . $g_{6: 4}=$ $\qquad$ . $p_{6: 4}=$ $\qquad$ . $g_{7: 4}=$ $\qquad$ . $p_{7: 4}=$ $\qquad$
$[11: 8] g_{9: 8}=$ $\qquad$ _. $p_{9: 8}=$ $\qquad$ - $g_{10: 8}=$ $\qquad$ . $p_{10: 8}=$ $\qquad$ $g_{11: 8}=$ $\qquad$ - $p_{11: 8}=$ $\qquad$
$[15: 12] g_{13: 12}=\ldots \cdot p_{13: 12}=\ldots . g_{14: 12}=$ $\qquad$ _. $p_{14: 12}=$ $\qquad$ . $g_{15: 12}=$ $\qquad$ - $p_{15: 12}=$ $\qquad$

Step 3)
$[3: 0] c_{1}=\ldots . \quad c_{2}=\ldots \cdot c_{3}=\ldots$.
$c_{4}=\ldots$.

Step 4)
$[3: 0] s_{0}=\ldots \cdot s_{1}=\ldots \cdot s_{2}=\ldots \cdot s_{3}=\ldots$.
$[7: 4] c_{5}=\ldots \cdot c_{6}=\ldots \cdot c_{7}=\ldots$.
$c_{8}=\ldots$.

Step 5)
$[7: 4] s_{4}=\ldots . s_{5}=\ldots \cdot s_{6}=\ldots . s_{7}=\ldots$.
$[11: 8] c_{9}=\ldots . c_{10}=\ldots \cdot c_{11}=\ldots$.
$c_{12}=$ $\qquad$

Step 6)
$[11: 8] s_{8}=\ldots . s_{9}=\ldots \cdot s_{10}=\ldots \cdot s_{11}=\ldots$.
[15:12] $c_{13}=\ldots . c_{14}=\ldots \cdot c_{15}=\ldots$.
$c_{16}=\ldots$.

Step 7)
$[15: 12] s_{12}=\ldots . s_{13}=\ldots . s_{14}=\ldots . s_{15}=\ldots$.
2) Repeat it for the following inputs.

$$
\begin{gathered}
A=0101010101010101 \\
B=1010101011101010 \\
C_{0}=1
\end{gathered}
$$

Step 1)
$[3: 0] g_{0}=\ldots \cdot g_{1}=\ldots \cdot g_{2}=\ldots \cdot g_{3}=\ldots \cdot p_{0}=\ldots \cdot p_{1}=\ldots \cdot p_{2}=\ldots \cdot p_{3}=\ldots$.
[7:4] $g_{4}=$ $\qquad$ . $g_{5}=$ $\qquad$ . $g_{6}=$ $\qquad$ . $g_{7}=$ $\qquad$ $p_{4}=$ $\qquad$ . $p_{5}=\ldots \cdot p_{6}=$ $\qquad$ - $p_{7}=$. . $[11: 8] g_{8}=\ldots \cdot g_{9}=\ldots \cdot g_{10}=\ldots \cdot g_{11}=\ldots \cdot p_{8}=\ldots \cdot p_{9}=\ldots \cdot p_{10}=\ldots \cdot p_{11}=$
$\qquad$
$\qquad$
$\qquad$ _.
$[15: 12] g_{12}=\ldots \cdot g_{13}=\ldots \cdot g_{14}=\ldots \cdot g_{15}=\ldots \cdot p_{12}=\ldots \cdot p_{13}=\ldots \cdot p_{14}=\ldots$. $p_{15}=\ldots$.

Step 2)
$[3: 0] g_{1: 0}=\ldots \cdot p_{1: 0}=\ldots \cdot g_{2: 0}=\ldots \cdot p_{2: 0}=\ldots \cdot g_{3: 0}=\ldots \cdot p_{3: 0}=\ldots$.
$[7: 4] g_{5: 4}=\ldots \cdot p_{5: 4}=\ldots \cdot g_{6: 4}=\ldots \cdot p_{6: 4}=\ldots \cdot g_{7: 4}=\ldots \cdot p_{7: 4}=\ldots$.
$[11: 8] g_{9: 8}=\ldots \cdot p_{9: 8}=\ldots \cdot g_{10: 8}=\ldots \cdot p_{10: 8}=\ldots \cdot g_{11: 8}=\ldots \cdot p_{11: 8}=\ldots$
$[15: 12] g_{13: 12}=\ldots \cdot p_{13: 12}=\ldots \cdot g_{14: 12}=\ldots \cdot p_{14: 12}=\ldots \cdot g_{15: 12}=\ldots \cdot p_{15: 12}=\ldots$.
Step 3)
$[3: 0] c_{1}=\ldots . \quad c_{2}=\ldots . c_{3}=\ldots$.
$c_{4}=\ldots$.

Step 4)
$[3: 0] s_{0}=\ldots . s_{1}=\ldots \cdot s_{2}=\ldots . s_{3}=\ldots$.
$[7: 4] c_{5}=\ldots . c_{6}=\ldots \cdot c_{7}=\ldots$.
$c_{8}=\ldots$.

Step 5)
$[7: 4] s_{4}=\ldots \cdot s_{5}=\ldots \cdot s_{6}=\ldots \cdot s_{7}=\ldots$.
$[11: 8] c_{9}=\ldots . c_{10}=\ldots . c_{11}=\ldots$.
$c_{12}=\ldots$.

Step 6)
$[11: 8] s_{8}=\ldots \cdot s_{9}=\ldots \cdot s_{10}=\ldots \cdot s_{11}=\ldots$.
[15:12] $c_{13}=\ldots . c_{14}=\ldots . c_{15}=\ldots$.
$c_{16}=\ldots$.

Step 7)
$[15: 12] s_{12}=\ldots \cdot s_{13}=\ldots \cdot s_{14}=\ldots \cdot s_{15}=\ldots$.

## Problem \#7 (Prefix Adder, 10 points)

Use the following delay values:

- AND, OR, XOR: $\Delta$
- Two-level (sum-of-product) logic: $2 \Delta$
- $g_{i}=a_{i} \cdot b_{i}, p_{i}=a_{i} \oplus b_{i}$

We are designing a 1024-bit Kogge-Stone adder.
Represent $s_{885}$ hierarchically using group-generated and group-propagated carries ( $g_{i: k}, p_{i: k}$ ) and $c_{0}$ (primary carry-in), then compute the delay to compute $s_{885}$ assuming all the primary input signals are available at time 0 (10 points).

## Problem \#8 (Prefix Adder, 10 points)

Use the following delay values:

- AND, OR, XOR: $\Delta$
- Two-level (sum-of-product) logic: $2 \Delta$
- $g_{i}=a_{i} \cdot b_{i}, p_{i}=a_{i} \oplus b_{i}$

We are designing a 1024-bit adder, which is similar to the Kogge-Stone adder. However, we will design the adder as follows.

- Step 0: Compute $g_{i}$ and $p_{i}$.
- $\quad$ Step 1: Instead of generating $g_{i: i-1}, p_{i: i-1}$ for each $i$ by merging $g_{i}, p_{i}$ and $g_{i-1}, p_{i-1}$, generate $g_{i: i-3}, p_{i: i-3}$ for each $i$ (except $i=0,1,2$. For $\mathrm{i}=1$, merge $g_{1}, p_{1}, g_{0}, p_{0}$. For $\mathrm{i}=2$, merge $g_{2}, p_{2}, \ldots, g_{0}, p_{0}$ ) by merging $g_{i}, p_{i}, g_{i-1}, p_{i-1}, g_{i-2}, p_{i-2}, g_{i-3}, p_{i-3}$.
- Step 2: Generate $g_{i: i-15}, p_{i: i-15}$ for each $i$ by merging
$g_{i: i-3}, p_{i: i-3}, g_{i-4: i-7}, p_{i-4: i-7}, g_{i-8: i-11}, p_{i-8: i-11}, g_{i-12: i-15}, p_{i-12: i-15}$. Notice that this cannot be applied to $i=0, \ldots, 14$. However, you can generate $g_{i: 0}, p_{i: 0}$ for them properly.
- $\quad$ Step 3: Generate $g_{i: i-63}, p_{i: i-63}$ for each $i$.
- Repeat.

Represent $s_{885}$ hierarchically using group-generated and group-propagated carries ( $g_{i: k}, p_{i: k}$ ) and $c_{0}$ (primary carry-in), then compute the delay to compute $s_{885}$ assuming all the primary input signals are available at time 0 (10 points).

## Problem \#9 (Carry Look-Ahead Adder, 40 points)

The max. fanout is 4 . Use the following delay values:

- AND, OR, XOR: $\Delta$
- Two-level (sum-of-product) logic: $2 \Delta$
- $g_{i}=a_{i} \cdot b_{i}, p_{i}=a_{i} \oplus b_{i}$

We are designing a 2048-bit carry look-ahead adder.

1) Represent $s_{885}$ hierarchically using group-generated and group-propagated carries ( $g_{i: k}, p_{i: k}$ ) and $c_{0}$ (primary carry-in), then compute the delay to compute $s_{885}$ assuming all the primary input signals are available at time 0 (10 points).
2) Represent $s_{2019}$ hierarchically using group-generated and group-propagated carries ( $g_{i: k}, p_{i: k}$ ) and $c_{0}$ (primary carry-in), then compute the delay to compute $s_{2019}$ assuming all the primary input signals are available at time 0 (10 points).
3) Show details of the calculations in the 16-bit carry-lookahead adder for the following inputs (show $g_{i}, p_{i}$, group generation/propagation bits, carry signals, etc.) Draw some block diagrams too for the carry-lookahead units and show the values of the signals.

$$
\begin{gathered}
A=0101010101010101 \\
B=1010101010101010 \\
C_{0}=1
\end{gathered}
$$

You can show the details like the one in Problem 6.
4) Repeat it for the following inputs.

$$
\begin{gathered}
A=0101010101010101 \\
B=1010101011101010 \\
C_{0}=1
\end{gathered}
$$

