首页 / 专利库 / 电脑零配件 / 协处理器 / Coprocessor and method of controlling the same

Coprocessor and method of controlling the same

阅读:779发布:2022-04-30

专利汇可以提供Coprocessor and method of controlling the same专利检索,专利查询,专利分析的服务。并且The interface portion of a coprocessor is provided with a FIFO (500) (First-in First-Out) and means for accepting instructions in succession. The pipeline control of the instructions becomes possible in this way, and protocol means (200) associated with a microprocessor is also provided.,下面是Coprocessor and method of controlling the same专利的具体信息内容。

1. A method of controlling a coprocessor which is connected with a microprocessor. characterized in thatcommands and operands which are sent to said coprocessor under control of said microprocessor are fetched in a FIFO (First-In First-Out) memory (560).2. A method of controlling a coprocessor as defined in claim 1, wherein a processing function of said coprocessor is divided so as to perform a pipeline control.3. A method of controlling a coprocessor which is connected with a microprocessor, characterized in that a protocol is divided into a command transfer protocol and an operand transfer protocol.4. A floating-point processor having a coprocessor, characterized in that a protocol control portion (200), a format conversion portion (201) and a floating-point execution portion (101) of said coprocessor are subjected to a pipeline control.
说明书全文

BACKGROUND OF THE INVENTION:

The present invention relates to a pipeline system, and more particularly to the interface system of a coprocessor which is well suited to the high-speed processing of a floating-point processor that is the coprocessor of a processor.

As stated in "IEEE MICRO," pp. 44 - 54, 1983. 12 by way of example, a coprocessor interface in a prior art is of a system wherein, when one instruction has ended and the next instruction has been sent, a status in the execution of the operation of the preceding instruction is brought back. Therefore, the instructions are intermittently sent to a coprocessor one by one, and the overhead of the interface is heavy.

The prior-art technique does not take the enhancement of the throughput of operations into consideration, and has the problem of along operating time.

SUMMARY OF THE INVENTION:

An object of the present invention is to raise the operating speed of a coprocessor in such a way that the acceptance of instructions in the coprocessor is pipelined.

The object is accomplished by dividing the function of the coprocessor, especially a floating-point processor.

More specifically, the interface portion of the coprocessor is provided with a FIFO (First-In First-Out) and means for accepting instructions in succession. Besides, the pipeline control of the instructions becomes possible in this way, and protocol means associated with a microprocessor is also provided.

By disposing the FIFO in the interface portion of the coprocessor, the microprocessor can send data items until the FIFO is filled up, and further, the coprocessor has its function divided so as to be capable of the pipeline control, so that the data items are successively processed at high speed.

Therefore, the present invention can execute an operation at high speed.

BRIEF DESCRIPTION OF THE DRAWINGS:

  • Fig. 1 is a diagram of a microcomputer system which includes a coprocessor in an embodiment of the present invention;
  • Fig. 2 is a diagram of an embodiment of the blocks of a floating-point processor according to the present invention;
  • Fig. 3 is a diagram for explaining a format conversion function;
  • Fig. 4 is a timing chart of a command pipeline control according to the present invention;
  • Fig. 5 is a diagram of an embodiment of the internal blocks of a bus control unit; and
  • Figs. 6 - 11 are detailed timing charts of protocols in an embodiment of the present invention.

DETAILED DESCRIPTION OF THE EMBODIMENTS:

Now. an embodiment of the present invention will be described with reference to Fig. 1. Shown in Fig. 1 is a microcomputer system which realizes floating-point arithmetic. In order to perform the floating-point arithmetic, this system has two arithmetic circuits; a CPU (Central Processing Unit) 100 and an FPU (Floating-point Processing Unit) 101,which are preferably integrated on semiconductor substrates separate from each other but which may well be integrated on a single semiconductor substrate. Essentially, the CPU 100 processes only data of the integer type and cannot process data of the floating-point type. On the other hand, the FPU 101 can process the data of the floating-point type. A ROM (Read Only Memory) 102 stores a program in which instructions for the two arithmetic circuits of the CPU 100 and the FPU 101 coexist. However, it is the CPU 100 that can decode the instructions. Therefore, the FPU 101 is interfaced at high speed by protocols with the CPU 100. Here, the FPU 101 of such properties shall be called the "coprocessor."

A RAM (Random Access Memory) 103 holds data of the integer type and the floating-point type. Numeral 104 indicates an address bus for addresses delivered from the CPU 100. Numeral 105 indicates a data bus, and numeral 106 a control bus for controls concerning interfaces.

Next, the details of a coprocessor interface will be described by taking the floating-point coprocessor (FPU) 101 as an example.

Fig. 2 shows a block diagram of the floating-point coprocessor 101.

Numeral 200 indicates a bus control unit (BCU) which executes the coprocessor interface of high speed. Inputioutput signals connected to the BCU 200 are signals CLK, D31 - D0, CS , AS D S , R/W̃, , CD̃ E,-CPST2 - CPSTO, AT3 - ATO, and A2 - A0. The signal CLK is a clock signal which is externally supplied, and which serves as the reference of the internal operations of the floating-point coprocessor. The signals D31 - DO are data signals. In the figure, the signal is a chip select signal for selecting the FPU 101. The signal is an address strobe input signal which indicates that an address signal exists on the address bus 104. The signal is a data strobe input signal which indicates that a data signal exists on the data bus 105. The signal R/W̃ is an input signal indicating the direction of data transfer, and it becomes "1" when the CPU 100 reads data from an external memory, while it becomes "0" when the CPU writes data thereinto. The signal DC indicates that data transfer on the data bus 105 has ended. In a case where data is transferred between the CPU 100 and the FPU 101, the signal is output from the FPU 101 to the CPU 100. Besides, in a case where data is transferred between the FPU 101 and the external memory, the signal is input from the external memory to the FPU 101. The signal C D̃ E is a coprocessor data enable signal. Upon lapse of one clock cycle after this signal has been asserted, the FPU 101 outputs a destination operand onto the data bus 105. This signal is applied from the CPU 100 as an output timing recognition signal. The signals CPST2 - CPSTO are signals which indicate the internal operation status of the FPU 101. In addition, the signals AT2 - ATO are signals which are output from the CPU 100 and which indicate the sort of an access. The FPU 101 operates in such a way that the type of control information to be transferred from the CPU 100 is recognized on the basis of these signals. That is, the FPU 101 operates upon receiving the signals AT2 - ATO from the CPU 100 and returns the signals CPST2 - CPSTO indicative of its operating status to the CPU 100, thereby to execute the coprocessor protocol.

Table 1 and Table 2 are the decode tables of the signals CPST2 - CPSTO and AT2 - ATO, respectively.

The signals A2 - AO are address data signals which are output from the CPU 100.

By decoding a command as described above, the protocol between the CPU 100 and the FPU 101 is controlled and managed.

Numeral 201 indicates a format conversion unit (FCU) by which single-precision data, double-precision data and double-extended-precision data represented in an external format are converted into data in an internal format (double-extended-precision format).

Input/output data to or from the FCU 201 is transferred through the BCU 200 as well as a conversion data bus 206. An example of the operation is shown in Fig. 3. By way of example, in a case where external data stored in the BCU 200 is the single-precision data (having a sign of 1 bit, an exponent of 8 bits, and a mantissa of 23 bits), it is input to the FCU 201 through the converted data bus 206. As shown in Fig. 3. the exponent part of the single-precision data having been input to the FCU 201 has its bias value removed into an exponent of 15 bits in conformity with the specification of IEEE. In addition, a mantissa is set in such a way that the 23 bits are placed close to an upper bit side, and that all the remaining bits are made "0." On the other hand, in a case where data is transferred from the FCU 201 to the BCU 200, the FCU 201 performs a conversion reverse to that of Fig. 3; from the internal format (double-extended-precision format) into a designated external format.

Numeral 202 indicates an execution unit (EU) which executes the floating-point arithmetic. The EU 202 receives the converted data in the internal format (double-extended-precision format) from the FCU 201 and through an operation data bus 207, and it executes a desired operation. It is constructed of an ALU, a register file, a barrel shifter, etc.

Numeral 203 indicates a microprogram unit (MAC) for executing a floating-point instruction, and an instruction execution code 209 decoded by and output from the MAC 203 is passed through an instruction decoder 204 into FCU control signals 211 and EU control signals 210, by which the FCU 201 and the EU 202 are respectively controlled.

Numeral 205 denotes an instruction sequencer (ISC) for controlling the MAC 203. In addition, a command to the ISC 205 is output from the BCU 200 and through an instruction bus 208.

The above description concerns the flow of data. Besides, the BCU 200 receives a command having been output from the CPU 100 and delivers it to the ISC 205. The delivered command is decoded in the ISC 205.

Fig. 4 is a timing chart for explaining the pipeline control system of the FPU 101. In this figure, the flows of the executions of the k-th command CMDk -the (k + 3)-th command CMDk-3 are shown. Besides, Fig. 5 shows the internal arrangement of the bus control unit BCU 200.

Numeral 500 denotes a FIFO (First-In First-Out) for fetching data on the commands and operands of the data bus 105. In operation, as illustrated in Fig. 4, the k-th command CMDk is fetched in the FIFO 500 under a state ① in the figure, the k-th source operand OPSk is fetched in the FIFO 500 under a state ② . and the k-th instruction address IARk is fetched in the FIFO 500 under a state ③. The (k + 1)-th command CMDk-1, source operand OPsk-1, and instruction address IARk+1 are successively fetched in the FIFO 500 under states ④ , ⑤ and ⑥ : the (k + 2)-th command CMDk-2, source operand OPSk-2 and instruction address IARk+2 are similarly done under states ⑦, ⑧ and ⑨; and the (k + 3)-th command CMDk-3, source operand OPSk+3 and instruction address IARk+3 are similarly done under states ⑩ ⑪ and ⑫ . The commands, operands and instruction addresses fetched in the FIFO 500 are output to a BCU internal bus 502.

Among the data having been output to the BCU internal bus 502, the commands are input to a protocol execution portion (PTE) 501, which controls and manages protocols and supplies the CPU 100 with the signals CPST2 - CPSTO indicative of the internal operation status.

In addition, the commands are input to the instruction sequencer 205 through the instruction bus 208.

On the other hand, the operands are input to the format conversion unit (FCU) 201 through the BCU internal bus 502 as well as the conversion data bus 206. (For example, under the state ③, the k-th operand OPSk is input to the FCU 201, in which the format conversion is executed.)

Subsequently, the data subjected to the format conversion by the FCU 201 is sent to the floating-point execution unit EU 202, which then starts the execution of the k-th command CMDk. Simultaneously with the start of the execution, the fetch of the succeeding (k + 1)-th command CMDk+1 is started. (state @)

By pipeline-controlling the executions of the BCU 200, FCU 201 and EU 202 in this manner, the processed result of the (k-1 )-th command CMDk.1 can be obtained at the end of the state ③, that of the k-th command CMDk at the end of the state ⑥, that of the (k + 1 )-th command CMDk+1 at the end of the state ⑨, and that of the (k + 2)-th command CMDk.2 at the end of the state ⑫. Shown in Fig. 4 is a case where the source operands are fetched from a memory and where destination operands are fetched from a register.

Next, Figs. 6 - 11 are diagrams elucidating the executions of protocols.

Basic protocols can be broadly classified as follows:

  • (1) Case where an operand out transfer exists, that is, where a succeeding command is accepted in synchronism with an output.
  • (2) Case of an n-term instruction or the like where a burst transfer is impossible.
  • (3) Case of a conditional branch or the like where no command is accepted until the execution of a preceding command ends.
  • (4) Case of a privileged instruction or the like where the command thereof is accepted at any time.
  • (5) Case other than the above cases (1) - (4), where the operand out transfer does not exist.

Depending upon these cases (1) - (5), the FPU 101 provides the signals CPST2 - CPSTO and executes the protocol in a bus cycle concerning the FPU 101, subsequent to a command transfer (command transfer protocol), or when informing the CPU 100 of the grant or reservation of the start of the in/out transfer of an operand or the occurrence of any of various exceptions (operand transfer protocol).

The command transfer protocols are divided into the following cases:

  • (1) Case where the operand out transfer does not exist.
  • (2) Case where the operand out transfer exists.
  • (3) Case of the conditional branch.
  • (4) Case of FREST.
  • (5) Case of the privileged instruction.

Table 3 lists the output statuses of the signals CPST2 - CPSTO (Table 1) in the respective cases. In the command transfer protocols, the following output statuses of the signals CPST2 - CPSTO are used:

  • (1) ACC (Accepted)iTRUE
  • (2) CERR (Command Error)
  • (3) BUSY
  • (4) EXCP (Exception)
  • (5) DTR (Data Transfer Ready)/FALSE

In each of the aforementioned cases, the output conditions of the statuses ACC, CERR, BUSY, EXCP and DTR are listed. These conditions correspond to the case of performing the pipeline control, and depend upon the statuses of preceding and current commands. Here, the "preceding command" signifies a command which has been output from the FIFO 500 of the BCU 200. In addition, the "current command" signifies a command which has been currently input to the FIFO 500.

The operand transfer protocol is executed in such a way that the FPU 101 handshakes with the CPU 100 in accordance with the DTR status of the signals CPST2 - CPSTO thereof. In the absence of an error in both the preceding command and the current command, when an operand needs to be input, the CPU 100 acknowledges and decides the DTR status of the signals CPST2 - CPSTO, and thereafter, the transfer of the operand is started. Besides, the signals CPST2 - CPSTO indicate the BUSY status until the internal status of the FPU 101 gets ready to transfer the next operand.

In outputting an operand, the protocol is executed as in the operand input by the handshake of the FPU 101 with the CPU 100 in accordance with the DTR status of the signals CPST2 - CPSTO. That is, the start of an operand transfer cycle is granted according to the DTR status. Thereafter, the FPU 101 delivers the BUSY status until it becomes an internal status capable of outputting the signals of the DTR status for granting the transfer of the next operand.

Now, the timing charts of protocols are shown in Figs. 6 - 11. Timings differ between an operation mode and an addressing mode.

Fig. 6 is the detailed timing chart of the monadic and dyadic operations between a register (source) and a register (destination).

First, a command CMD on the data bus 105 is fetched in the FIFO 500 of the BCU 200. While an instruction address IAR is further fetched, the BCU 200 decodes the protocol of the command and generates a command transfer protocol in the protocol signals CPST2 - CPSTO in accordance with conditions listed in Table 3.

Also, Fig. 6 is the detailed timing chart of a branch instruction. In the operation, a branch command CMD on the data bus 105 is fetched in the FIFO 500 of the BCU 200. While an instruction address IAR is further fetched, the BCU 200 decodes a branch condition, to generate TRUE (ACC) if the condition is true and FALSE (DTR) if it is false, in the protocol signals CPST2 - CPSTO as indicated in Table 3.

  • Fig. 7 is the detailed timing chart of the monadic and dyadic operations oetween a memory (source) and a register (destination). Since, in this case, the operand(s) of the source is the data of the memory, the source operand(s) on the data bus 105 needs to be fetched. In the operation, a command CMD on the data bus 105 is fetched in the FIFO 500 of the BCU 200. Thereafter, data items in numbers required depending upon the size of the operand(s), the single precision S, the integer I, the double precision D and the double extended precision X are fetched in the FIFO 500 within the BCU 200. Besides, a command transfer protocol is generated as the protocol signals CPST2 - CPSTO in accordance with Table 3. After all the operands of the source have been transferred, an instruction address IAR is fetched in the BCU 200.
  • Fig. 8 is the detailed timing chart of the monadic operation between a memory (source) and a memory (destination) and the dyadic operation between a register (first source) as well as a memory (second source) and a memory (destination). In this case, the operands of the sources are the data of the memory. and the operands of the destination become storing data into the memory, so that the bus cycles of the operands arise twice. In the operation, a command CMD on the data bus 105 is fetched in the FIFO 500 within the BCU 200. Thereafter, in the same manner as in Fig. 7, data items in numbers required depending upon the size of the operands, the single precision S, the integer I, the double precision D and the double extended precision X are fetched in the FIFO 500 within the BCU 200. Besides, a command transfer protocol is generated as the protocol signals CPST2 - CPSTO. Since the destination is the memory, the BUSY status is output as the signals CPST2 - CPSTO till the end of the operations. When, upon the end of the operations, the FPU 101 has got ready to transfer data (in accordance with an operand transfer protocol), the status DTR (Data Transfer Ready) is output, and the operand data of the destination is transferred to the memory. Also in this case, the data items in numbers required depending upon the size of the operands, the single precision S, the integer I, the double precision D and the double extended precision X are transferred from the FIFO 500 within the BCU 200.
  • Fig. 9 is the detailed timing chart of the dyadic operation between a memory (source) and a memory (destination). In this case, the two operands of the source are the data of the memory, and the operand of the destination also becomes storing data into the memory, so that the bus cycles of the operands arise three times. In the operation, a command CMD on the data bus 105 is fetched in the FIFO 500 of the BCU 200. Thereafter, in the same manner as in Fig. 7, source operand #1 and source operand #2 as data items in numbers required depending upon the size of the operands, the single precision S, the integer I, the double precision D and the double extended precision X are fetched in the FIFO 500 within the BCU 200. Further, an instruction address IAR is fetched. Also in this case, as in Fig. 8, the BUSY status continues to be output as the protocol signals CPST2 - CPSTO till the end of the operation. When, upon the end of the operation, the FPU 101 has got ready to transfer data, the status DTR (Data Transfer Ready) is output according to an operand transfer protocol so as to transfer the operand data of the destination to the memory. Also in this case, the data items in numbers required depending upon the size of the operands, the single precision S, the integer I, the double precision D and the double extended precision X are transferred from the FIFO 500 within the BCU 200.

The FPU 101 has data transfer instructions besides operation instructions in the monadic and dyadic forms, etc. The data transfer instructions include an instruction of data transfer (operand in transfer) from the memory to the register of the FPU and an instruction of data transfer (operand out transfer) from the register of the FPU to the memory.

Fig. 10 shows the detailed timing chart of a large number of operand in-transfer instructions, while Fig. 11 shows the detailed timing chart of an operand out-transfer instruction.

Regarding the n operand in-transfer instructions in Fig. 10, in the same manner as in Fig. 7, data items in numbers required depending upon the size of the operands of sources, the single precision S, the integer I, the double precision D and the double extended precision X are fetched in the FIFO 500 within the BCU 200, while data items are sent to the FCU 201. In the operation, after a command CMD on the data bus 105 is fetched in the FIFO 500 within the BCU 200, the first source operand #1 is fetched, and thereafter, the protocol signals CPST2 - CPSTO output the status DTR (Data Transfer Ready) in accordance with an operand transfer protocol. In the illustrated case, there are n source operands, so that the signals CPST2 - CPSTO output the statuses BUSY and DTR alternately until the transfer of source operand #n ends.

Fig. 11 illustrates one operand out-transfer instruction. In the operation, a command CMD on the data bus 105 is fetched in the FIFO 500 within the BCU 200. Thereafter. the BCU 200 fetches an instruction address IAR because a source operand resides in the register of the FPU 101.

At that time, the protocol signals CPST2 - CPSTO output a command transfer protocol in accordance with Table 3. Thereafter, the status BUSY is output until the FPU 101 gets ready to transfer an operand. When the FPU has got ready for the transfer, the signals CPST2 - CPSTO outputs the status DTR being an operand transfer protocol, and operand data items OPD1 - OPD3 are delivered onto the data bus 105.

Incidentally, although one bus cycle is performed by three clock pulses in the drawings, this is not restrictive.

According to the present invention, floating-point arithmetic is functionally divided and is pipeline- controlled, whereby a command pipeline can be realized, so that the overhead of an interface in the floating-point arithmetic can be lessened, which is effective to raise an operating speed.

Moreover, protocols are defined between a microprocessor and a floating-point processor, so that disposal at the occurrence of an error is facilitated.

The protocols are applicable to coprocessors other than the floating-point processor.

高效检索全球专利

专利汇是专利免费检索,专利查询,专利分析-国家发明专利查询检索分析平台,是提供专利分析,专利查询,专利检索等数据服务功能的知识产权数据服务商。

我们的产品包含105个国家的1.26亿组数据,免费查、免费专利分析。

申请试用

分析报告

专利汇分析报告产品可以对行业情报数据进行梳理分析,涉及维度包括行业专利基本状况分析、地域分析、技术分析、发明人分析、申请人分析、专利权人分析、失效分析、核心专利分析、法律分析、研发重点分析、企业专利处境分析、技术处境分析、专利寿命分析、企业定位分析、引证分析等超过60个分析角度,系统通过AI智能系统对图表进行解读,只需1分钟,一键生成行业专利分析报告。

申请试用

QQ群二维码
意见反馈