Abstract – This paper presents a VHDL based method for timing simulation within a VHDL logic simulator framework. This method is reminiscent to the one published in [1, 2] and enables the logic simulator to evaluate the longest and the shortest path delays of all signals in the circuit with only one run of the logic simulator. Timing simulation is performed at simulation time t=0 at the cost of a negligible increase of CPU time needed for the simulation. Results of the timing simulation of the ISCAS’85 benchmark circuits with a VHDL simulator are presented that prove that the proposed method is extremely efficient and appropriate for interactive use in the early phases of the design process where timing analysis needs to be repeated as the circuit design is optimized or refined.

1. INTRODUCTION

Circuit operating frequency is one of the most important user requirements to digital integrated circuit designer. The maximum operating frequency is determined by the delay of the longest data path in the circuit, while the minimum duration of the clock pulse is determined by the delay of the shortest data path in the circuit. Circuit delays are usually being extracted with timing analysis programs [3-5]. When timing analysis shows that timing requirements are not being met, the designer must redesign the circuit and/or perform the delay minimization techniques on critical paths [6]. To avoid circuit redesign, the designer needs to carry out maximal delay estimation as early as possible in the design process.

It is shown in [1, 2] that a versatile logic simulator can produce an early estimation of circuit delays during the logic verification of the design in quite acceptable CPU time. In [2] a method is proposed that enables logic simulator to perform timing simulation. The method evaluates the longest path propagation delay for all signals in the circuit with only one run of the logic simulator.

In this paper an efficient implementation of the enhanced method in VHDL is presented. It is based on use of composite type signals that consist of signal logic value and needed timing data. To test the performance of the proposed method, longest and shortest path delays in ISCAS’85 benchmark circuits are evaluated using a VHDL simulator. The presented results prove an excellent performance of the method.

The paper is organized as follows. In section 2 the method proposed in [2] is described in short. Section 3 deals with enhanced method implementation in VHDL. In section 4 experimental results are presented.

2. TIMING SIMULATION WITH LOGIC SIMULATOR

When a digital circuit is simulated for one specific input vector, the instant time when the first activity occurs on a signal determines the shortest path delay to that particular signal for the given input vector. The time instant when the last activity occurs on a signal determines the longest path. In order to obtain the worst-case delays of both rising and falling signal transitions the circuit has to be simulated for all input vectors. Therefore, 2^n circuit simulations have to be carried out, where n is the number of primary inputs. It is obvious that this approach is not feasible when n becomes large.

In order to evaluate the longest and the shortest path delays to all the signals in the circuit with only one simulation of the circuit, simultaneous simulation of the circuit for all input vectors is necessary. To enable logic simulators to perform such a simulation, eight additional signal attributes need to be introduced to store the information of the arrival of the transitions at the signal and of the shortest and the longest path delays to the signal [2].

Having a signal named S, these four attributes are:

- d1mn(S) - the shortest path delay of rising transition at S,
- d1mx(S) - the shortest path delay of falling transition at S,
- arr1mn(S) - rising transition of the shortest path arrival flag,
- arr1mx(S) - falling transition of the shortest path arrival flag.

It is assumed that the circuit is described and simulated at structural level of abstraction and that the minimal and maximal delays of all building blocks for both rising and falling edge are known. At the start of the simulation the circuit is stimulated with both rising and falling transitions at all primary inputs. As the transitions propagate from primary inputs towards primary outputs, the gate delays are accumulated along the paths. Signal attributes are accessed from the processes in the gate models and their values are dynamically updated. Once the circuit activity is exhausted, the shortest and the longest path delays are available in signal attributes d1mn, d1mx, d0mn and d0mx of each signal in the circuit.

The timing simulation mechanism is inserted into gate models with additional processes that monitors and updates the values of signal attributes. For a model of two-input NAND gate this process is represented by the pseudo code shown in Fig. 1.

Fig. 1: Two-input AND gate model for longest path delay estimation with logic simulator

Gate inputs are denoted a and b, gate output is denoted f, whereas gate propagation delays for rising and falling edge at the output f are denoted tr and tf respectively. Each falling transition at an input of the gate results in rising transition at the output.
its output, but rising transition at an input is capable to produce falling transition at the output only if rising transition had previously arrived at the other gate input. Delay model of arbitrary complexity can be applied, taking into account input signal slopes, loading capacitances and other parameters that influence ranges of gate delays \( t_r \) and \( t_f \). Each time a delay attribute of output signal \( f \) is changed, an event must be scheduled to signal \( f \) with zero delay in order to activate the processes performing the timing simulation in the gates driven by the NAND gate.

To implement the timing simulation algorithm in logic simulator, a generalized signal attribute modeling mechanism is proposed in [2]. It is built in AleC++ hardware description language – the input language of Alecisis simulator [7]. In AleC++ user-defined signal attributes can be accessed and updated in the processes during the simulation. The experimental results presented in [2] show that the proposed method is fast and appropriate for interactive use during logic verification.

### 3. IMPLEMENTATION IN VHDL

Although the user-defined signal attributes are available in VHDL [87], they cannot be accessed and updated from the processes during the simulation. Therefore, a different modeling mechanism is necessary for the implementation of the timing simulation in VHDL simulator. We propose the use of **composite signals of the record type**. Such a composite signal consists of logic state and the timing attributes \( arr1mn, arr1mx, arr0mn, arr0mx, d1mn, d1mx, d0mn \) and \( d0mx \). The record type DCSM_std_logic for the scalar signals can be declared in VHDL as:

```vhdl
subtype DCSM_std_logic is record
  state: std_logic;  
  d0mn: time;     
  d1mn: time;     
  arr0mn: boolean;  
  arr1mn: boolean;  
  state0: std_logic;  
  d0mx: time;     
  d1mx: time;     
  arr0mx: boolean;  
  arr1mx: boolean;  
end record DCSM_std_logic;
```

For the bus signals type DCSM_std_logic_vector can be declared as:

```vhdl
subtype DCSM_std_logic_vector is array
  (natural range <>) of DCSM_std_logic;
```

Assuming the input and output signals of the DCSM_std_logic type, the complete model of a two-input NAND gate can be described in VHDL as shown in Fig. 2. Processes labeled \( p1 \) and \( p2 \) are intended for standard logic simulation. They model the logic function and delay function of the NAND gate. Processes labeled \( p3 \) and \( p4 \) are intended for the timing simulation. They monitor the timing attributes of gate inputs \( inp1 \) and \( inp2 \) and update the timing attributes of gate output \( outp \).

Note that processes \( p3 \) and \( p4 \) are sensitive to the timing attributes of the input signals \( inp1 \) and \( inp2 \), but not to their logic states. Similarly, process \( p1 \) and \( p2 \) are sensitive to events on input signal states, but not to their timing attributes. Hence, the two types of processes are independent. Logic simulation and timing simulation can be performed separately or simultaneously.

It is also important that the changes of the timing attributes of the output signal \( outp \) are always scheduled with zero delay in processes \( p3 \) and \( p4 \). If the timing simulation is initiated at some time instant of simulation time, it will be finished at the same time instant, since the activity related to timing simulation is performed with no delay. We suggest that timing simulation should be performed in initial time instant \( t=0 \) of simulation. If no timing attribute is changed during the simulation, timing simulation is suppressed and the simulator performs standard logic simulation. Timing simulation is invoked by setting the timing attributes \( arr1mn, arr1mx, arr0mn \) and \( arr0mx \) of primary inputs to value true. If the primary inputs are grouped into a bus signal \( inp \), the timing analysis can be invoked by only one concurrent signal assignment command:

```vhdl
inp <= (others => ('0', 0 sec, 0 sec, true, true, '0', 0 sec, 0 sec, true, true));
```

Once the simulation time advances to \( t>0 \) sec, longest and shortest path delays of all signals in the circuit are available and can be logged out. Once additional process in the top-level entity can be added that waits for some small time period (for example, 1 pico second), prints the shortest and the longest path delays to the screen or file, and then suspends to the end of the simulation. An example of such a process is as follows:

```vhdl
log: process
begin
  wait for 1 ps;
  -- print the timing analysis results
  wait;
end process log;
```

```vhdl
entity nandg is
  generic
    tpd_hlmn : time := 0.95 ns;
    tpd_hlmx : time := 0.9 ns;
    tpd_hlm : time := 1.05 ns;
    tpd_hlmn : time := 0.0 sec, 0.0 sec, false, false, '0', 0.0 sec, 0.0 sec, false, false);
    tpd_hlmx : time := 0.0 sec, 0.0 sec, true, true, '0', 0.0 sec, 0.0 sec, false, false);
  port (out1 : out DCSM_std_logic := ('0', 0.0 sec, 0.0 sec, false, false, '0', 0.0 sec, 0.0 sec, false, false));
end nandg;
```

```vhdl
architecture only of nandg is
begin
  -- Logic function:
  p1: process(in1.statemn, in2.statemn)
  variable val, ex_value : std_logic := '0';
  begin
    case val is
      when '1' =>
        out1.statemn <= val after tpd_hlmn;
      when '0' =>
        out1.statemn <= val after tpd_hlm;
      when others =>
        end case;
      end if;
  end process p1;
end nandg;
```

```vhdl
p2: process(in1.statemx, in2.statemx)
variable val, ex_value : std_logic := '0';
begin
  case val is
    when '1' =>
      out1.statemx <= val after tpd_hlm;
    when others =>
      end process;
  end if;
end process p2;
```

```vhdl
variable val, ex_value : std_logic := '0';
begin
  case val is
    when '1' =>
      out1.statemx <= val after tpd_hlmn;
    when others =>
      end process;
  end if;
end process p2;
```
The sequential elements are modeled in a similar manner. For timing simulation, data inputs of sequential elements are the ending points of the signal propagation paths, whereas the outputs of the sequential elements are the starting points of the signal propagation paths. That means the sequential element must assign the value true to the timing attributes of data inputs at some time \( t > 0 \). Of output signals at time \( t = 0 \) and log out the timing attributes \( d1mn, d1mx, d0mn \) and \( d0mx \) of data inputs at some time \( t > 0 \).

For example, the model of a D-type flip-flop can be described with the VHDL code shown in Fig. 3. The delay parameters of the flip-flop are directly added to the described with the VHDL code shown in Fig. 3. The delay path ending at data input \( d \). The log file logfile is declared in a package to avoid the overleaping of writes from different instances of flip-flop dff.

```vhdl
entity dff is
generic (tr_ck_qmn: time := 0.85 ns; 
tr_ck_qmx: time := 1.05 ns; 
tr_ck_qmn: time := 0.9 ns; 
tr_ck_qmx: time := 1.05 ns; 
tr_ck_qmn: time := 0.85 ns; 
tr_ck_qmx: time := 1 ns); 
port (dck: DCM Std_logic 
=> ('0',true,0sec,false,false, '0',true,0sec,false,false); 
q: out TS Std_logic 
=> ('0',true,0sec,false,false, '0',true,0sec,false,false)); 
end dff;
architecture dff_arch of dff is
begin
--- Logic simulation:
p1: process(ck.statemn) 
variable ex_value,val: std_logic := '0'; 
begin
if (ex_value /= val) then 
val := d.statemn; 
if rising_edge(ck.statemn) then 
ex_value := val; 
case val is 
when '0' => 
end if;
end process p1;
p2: process(ck.statemmx) 
variable ex_value,val: std_logic := '0'; 
begin
if (ex_value /= val) then 
val := d.statemnx; 
if rising_edge(ck.statemnx) then 
ex_value := val; 
case val is 
when '0' => 
end if;
end process p2;
end only;
end architecture dff_arch;
end dff;
```

4. EXPERIMENTAL RESULTS

The efficiency of the proposed method is tested on ISCAS'85 benchmark circuits [9]. They are simulated with VHDL simulator ActiveHDL [10]. Timing simulation is performed at \( t=0 \) picoseconds and simulation is finished at \( t=2 \) picoseconds. To be able to easily validate the results, for this experiment all the gates in the ISCAS'85 circuits were first chosen to have delays \( tsrcm=tsrnx=tfmnn=tfmnx=1ns \). In such a case, the longest path delays of rising and falling edge of each signal are the same and match the topological level of that particular signal expressed in nanoseconds, as shown in column D, in Table 1. Since the gate delays are 1 nanosecond and the simulation is stopped after only 2 picoseconds, only timing simulation and initialisation phase of the logic simulation is done. The results of the shortests and the longest path delays analysis are also shown in Table 1, for the simulation of the same circuits which gates have following delays: \( tfmnn=0.95ns, tsrcm=1ns, tfmnx=0.9ns, trnx=1.05ns \). For each
ISCAS'85 circuit, the simulation result was a list of $d_I$ and $d_0$, minimal and maximal parameters of all primary output signals. The maximal of these values is given in column $D_{max}$, while the minimal values are given in column $D_{min}$ of Table 1 representing the delay of the longest and the shortest paths in the circuit.

### Table 1: Results of timing and logic simulation of ISCAS'85 benchmark circuits with a VHDL simulator

<table>
<thead>
<tr>
<th>Circuit</th>
<th>Number of signals</th>
<th>Number of inputs</th>
<th>Number of outputs</th>
<th>Number of gates</th>
<th>$D_{min}$ [ns]</th>
<th>$D_{max}$ [ns]</th>
<th>$D_{min}$ [ns]</th>
<th>$D_{max}$ [ns]</th>
</tr>
</thead>
<tbody>
<tr>
<td>c17</td>
<td>11</td>
<td>5</td>
<td>2</td>
<td>6</td>
<td>3</td>
<td>1.95</td>
<td>2.85</td>
<td>1.95</td>
</tr>
<tr>
<td>c432</td>
<td>36</td>
<td>7</td>
<td>160</td>
<td>17</td>
<td>1.95</td>
<td>16.5</td>
<td>1.95</td>
<td>16.5</td>
</tr>
<tr>
<td>c499</td>
<td>243</td>
<td>41</td>
<td>32</td>
<td>202</td>
<td>11</td>
<td>0.95</td>
<td>11.25</td>
<td>1</td>
</tr>
<tr>
<td>c880</td>
<td>443</td>
<td>60</td>
<td>26</td>
<td>383</td>
<td>24</td>
<td>5.8</td>
<td>23.4</td>
<td>5.9</td>
</tr>
<tr>
<td>c1355</td>
<td>587</td>
<td>41</td>
<td>32</td>
<td>546</td>
<td>24</td>
<td>2.9</td>
<td>23.55</td>
<td>2.95</td>
</tr>
<tr>
<td>c2670</td>
<td>1426</td>
<td>233</td>
<td>140</td>
<td>1193</td>
<td>32</td>
<td>4.9</td>
<td>31.65</td>
<td>4.85</td>
</tr>
<tr>
<td>c3540</td>
<td>1719</td>
<td>50</td>
<td>22</td>
<td>1669</td>
<td>47</td>
<td>13.7</td>
<td>46.2</td>
<td>13.5</td>
</tr>
<tr>
<td>c5315</td>
<td>2485</td>
<td>178</td>
<td>123</td>
<td>2307</td>
<td>49</td>
<td>3.95</td>
<td>48.3</td>
<td>3.85</td>
</tr>
<tr>
<td>c6288</td>
<td>2448</td>
<td>32</td>
<td>32</td>
<td>2416</td>
<td>124</td>
<td>4.85</td>
<td>120.9</td>
<td>4.9</td>
</tr>
<tr>
<td>c7552</td>
<td>3719</td>
<td>207</td>
<td>108</td>
<td>3512</td>
<td>43</td>
<td>5.9</td>
<td>41.85</td>
<td>5.8</td>
</tr>
</tbody>
</table>

To estimate the additional CPU time consumed by the timing simulation during the simulator run, the ISCAS'85 circuits are also simulated using standard logic package `ieee.std_logic_1164`. Signals of type `std_logic` and `std_logic_vector` were used in this experiment. Since timing simulation in the first experiment is finished at $t=2$ picoseconds, the logic simulation in the second experiment is also finished at $t=2$ picoseconds. Timing simulation does not increase significantly the total CPU time.

### 5. CONCLUSION

During the design process of digital integrated circuits it is extremely important to estimate the longest and the shortest path delays as early as possible. Design verification is one of the early phases in the design process. It is usually performed using a logic simulator to verify that the circuit meets the required logic function and a timing simulator to verify that the circuit meets the required timing specifications. The method proposed in this paper combines these two tools into one by enabling the VHDL simulator to perform the timing simulation. In this way the design process is simplified and accelerated.

The problem of introducing the timing simulation into the VHDL simulator is solved by the use of signals of record type in VHDL. Additional timing data is associated with each signal and used only when the timing simulation is requested. Eight timing attributes are added to each signal.

Extra memory for storing the timing attributes is proportional to the total number of signals in the circuit. For simulated ISCAS'85 circuits this number is less than 4000. Even if very complex circuits with millions of signals should be simulated, the required extra memory would be of the order of only tens of Mbytes. Storing memory is not a critical issue and the simulator performance should not be affected by the memory requirements.

The experimental results also indicate that extra simulation CPU time due to timing simulation is almost negligible for ISCAS'85 circuits. Timing simulation affects only the initialization phase of the simulation, whereas the rest of the simulation flow usually takes the most of the CPU time. The proposed method is simple for implementation and libraries of gate models used in semi-custom design methodologies.

### REFERENCES


### Sadržaj

Sadržaj – U ovom radu predložen je metod analize kašnjenašenjih VHDL simulaatora. Ovaj metod je nadgradnja onih objavljenih u [1, 2], i omogućava simulatore da odužim minimalna i maksimalna kašnjenja svih puteva u kolu, i to izvršavanjem samo jedne simulacije u kolu. Vremenska analiza se izvršava u trenutku $t=0$, i samo neznatno produžava procesorsko vreme potrebno za simulaciju. Da bi se verifikovala efikasnost predloženog metoda, prikazani su i rezultati simulacije ISCAS'85 benchmark kola, korišćenjem VHDL simulaatora. Ovaj metod izuzetno je pogodan u ranim fazama procesa projektovanja gde je potrebno ponavljati vremensku analizu uvek kada se kolo optimizuje ili redizajnira.

**PROCESA KAŠNJENJA LOGIČKOG PUTA KORIŠĆENJEM VHDL LOGIČKE SIMULACIJE**

Miljana Sokolović and Dejan Maksimović