# HK386 : An x86-compatible 32bit CISC Microprocessor

C.M.Kyung, I.C.Park, S.K.Hong<sup>\*</sup>, K.S.Seong, B.S.Kong<sup>†</sup>, S.J.Lee, H.Choi

S.R.Maeng, D.T.Kim, J.S.Kim, S.H.Park and Y.J.Kang

KAIST

email: kyung@dalnara.kaist.ac.kr

Abstract— In this paper, we describe the implementation and design methodology of a microprocessor, called HK386. The microprocessor is compatible with Intel 80386 with respect to the behavior of each instruction set. As the extraction of the exact behavior of each instruction set is the single most important step in compatible chip design, we focused our effort on establishing the reliable verification strategy ensuring the complete instruction level compatibility. The HK386 was successfully designed and fabricated using 0.8 um CMOS technology.

## I. INTRODUCTION

We have developed 32-bit microprocessor called HK386 which is fully instruction-level and pin-to-pin compatible with Intel 80386. The HK386 is simply plugged into the real PC instead of Intel 80386 and run all application softwares. Maintaining the compatibility with previous generation processors saves huge effort in developing application software which forms huge market. Since the behavior of the Intel 80386 is very complex and veiled, the most time-consuming part in the development of HK386 was to verify the compatibility. We focused our effort to maintain compatibility and tried several approaches to verify compatibility with Intel 80386. In this paper, the ways we have used to verify the compatibility are described as well as the implementation method. The designed chip was fabricated with 0.8 um CMOS DLM ASIC process working up to 40MHz. The specifications of the HK386 are shown in Table I.

| Clock Frequency       | 40MHz                                   |
|-----------------------|-----------------------------------------|
| Peak Performance      | 20 MIPS                                 |
| Process Technology    | $0.8 \ \mu m \text{ CMOS DLM}$          |
| Die size              | $10\mathrm{mm}$ $	imes$ $10\mathrm{mm}$ |
| Number of Transistors | 400k                                    |
| Power                 | 1.2 W                                   |
| Development Time      | 4 years                                 |
| Package               | 132 pin PGA                             |

TABLE I The specification of the HK386

# II. ARCHITECTURE

As indicated in the block diagram of Fig.1, HK386 consists of 6 blocks; an instruction prefetch unit (FU), a decode unit (DU), a control unit (CU), an execution unit (EU), a memory management unit (MMU), and a bus interface unit (BU).



Fig. 1. Block diagram of HK386

These blocks operate in parallel and in pipelined manner to increase the performance. The FU fetches instructions and sends them to the DU. The DU translates the received instruction into a decoded form which is easier to execute, *i.e.*, generates addressing mode, operand size, and determines the start address of the control ROM for the instruction and so on. The CU has 4000 word  $\times$  40 bit microcode control ROM. Through micro-commands, the CU manages the EU and the MMU. The execution unit contains 8 general purpose registers, shifter, ALU and special hardware to efficiently handle the protected mode. The MMU consists of segmentation unit and paging unit. The segmentation unit translates effective address into the linear address, while the paging unit translates the corresponding linear address to the physical address to be sent to the BU. The BU controls all of memory accesses, and generates bus control signals required to communicate with the outside of HK386.

<sup>\*</sup>Affiliated with Design Lab.1 Semiconductor R&D Lab.2, Hyundai Electronics Industries

<sup>&</sup>lt;sup>†</sup>Affiliated with R&D Lab. 3 LG semicon

#### III. A DESIGN METHODOLOGY FOR COMPATIBLE CHIPS

In this section, we describe a design methodology that is effective in the design of compatible chips. We used cleanroom and top-down approach for the design of HK386. The most important thing in the compatible chip design is to extract the exact behavior of instruction set. Especially, x86 instruction set is complex and veiled. We analyzed the behavior of x86 instructions using instruction set analyzer(ISA) whose concept is explained in Fig. 2 (a). For a given test program, the ISA executes x86 instructions in the real PC environment and, also through the instruction set model, and then the two results are compared and analyzed if any discrepancy occurs.



Fig. 2. Instruction Set Analyzer and C-model hardware verification

Once knowledge on the behavior of the instruction set is captured, we then build an instruction set simulator(ISS) which is a collection of C-language description of each instruction. To verify the instruction level compatibility of the ISS, there are two approaches. The first approach is to construct PC-environment, which denotes all hardware components in PC except the microprocessor itself, in software model. Therefore, various programs such as operating systems, e.g. MS-DOS, Windows3.1 and Linux, and application programs can be run on the fully integrated software model. The second approach is to use the target system board, e.g., real PC hardware except microprocessor, which is shown in Fig 2(b), as it is composed of host computer, target system board and interface board. The host computer simulates the instruction set through ISS and the target system board is used as real PC environment. The target board and host computer are connected through the interface board which contains several FPGA chips which emulate the bus interface part of the microprocessor and include the interface circuit which receives and sends messages from and to host computer.

From the basis of instruction set behavior and target architecture, design specification is defined. We defined the HK386 architecture as pin-to-pin compatible with Intel 80386 and partially pipelined machine. The functional model and structural model were implemented as defined in design specification with Verilog. To guarantee the consistency between ISS and structural model, we compared the results of instruction execution of both models at every instruction boundary using interprocess communication(IPC), where ISS and structural model run in parallel, executing each instructions one by one. When the structural model completes one instruction, it sends its results to the ISS, which compares the received results with its own results and reports whether the results are consistent or not.

#### IV. IMPLEMENTATION

We used Verilog HDL and C-language as front-end tools and moved the design to the COMPASS tool which was used as back-end tools. The HK386 was fabricated in 0.8um CMOS technology and its photograph is shown in Fig. 3. There are several macro-blocks shown as regular part in chip photograph, *e.g.*, data-path of each block and ROM, which were generated with data-path synthesizer and ROM generator. The control logic was implemented using standard cell generated with logic synthesizer. The macro blocks are manually placed and standard cell and overall routing was performed by using automatic P&R tool. The HK386 chip is plugged in real PC instead of Intel 80386 and several operating systems, such as MS-DOS, windows 3.1 and Linux, and MS-DOS and windows applications were successfully run.



Fig. 3. The photograph of the HK386

### References

- J.N.Hennessy and D.A.Patterson, 'Computer Architecture A Quantitative Approach', Morgan Kaufman Publishers, 1990.
- [2] Paul Chow, 'The MIPS-X RISC Microprocessor', Kluwer Academic Publishers, 1989.
- [3] T. Ohtsuki, 'Logic Design And Simulation', Elsevier Science Publishers, 1991.
- [4] W.R. Stevens, 'Advanced Programming in the UNIX Environment', Addision-Wesley Publishing Company, 1992.
- [5] H.Busch, H.Nusser, T.Rossel, 'Formal Methods for Synthesis' in 'The Synthesis Approaches to Digital System Design', Kluwer Academic Publishers, 1992.