Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Pipelined Processors - High Performance Computing - Assignment, Exercises of Computer Science

Some concept of High Performance Computing are Cache Directory, Computer Organization, Computer Organization, Command Interpretation, Operating System, Pipelined Processors, Handling Interrupts. Main points of this assignment are: Pipelined, Implementation, Processor, Technique, Improves Performance, High Frequency, Instruction, Sequence, Forwarding, Bypassing

Typology: Exercises

2012/2013

Uploaded on 04/27/2013

eha
eha 🇮🇳

4.3

(6)

43 documents

1 / 2

Toggle sidebar

Related documents


Partial preview of the text

Download Pipelined Processors - High Performance Computing - Assignment and more Exercises Computer Science in PDF only on Docsity! Module 5 (Lectures 21-24) Pipelined processors 1. Consider the 5 stage processor pipeline that we discussed. As we observed, this pipeline can potentially speed-up program execution by a factor of 5, when compared to an equivalent non-pipelined processor. What would the speed-up be considering a program for which every 5th instruction suffers a 1 cycle `bubble’ due to a hazard? 2. Consider the MIPS 1 code fragment on the 5 stage processor pipeline that we discussed. Many of the instructions in this fragment are dependent on each other. Mark the data dependencies, labeling each by its type (RAW, WAR, etc) and identifying those that would NOT be handled by the pipeline implementation techniques (e.g., result forwarding) that we discussed. LW R7, -8(R7) ADD R3, R5, R7 SUB R5, R3, R7 OR R7, R3, R7 3. The manual of a particular computer provides the following warnings for programmers regarding the processor pipeline: There are two load delay slots. There is one branch delay slot. There must be at least 2 instructions between a floating point computation instruction and a floating point store instruction that uses the value computed by the computation operation for correct operation to occur. You are given the inner loop of a program below. The instructions whose mnemonics start with the letter `F’ are floating point instructions that use floating point registers F0..F31. Loop: FLOAD F0, 0(R1) FLOAD F2, 0(R2) FADD F4, F0, F2 FSTORE 0(R1), F4 ADDI R1, R1, 8 ADDI R2, R2, 8 BLE R1, R3, Loop How many cycles does one iteration of the loop take in its present form, once it has been corrected with insertion of NOPs (no-operation instructions) to take into account the warnings mentioned above? Do static instruction scheduling to improve the loop as much as you can. How many cycles does each iteration now take? 4. The manual of a particular computer provides the following warnings for programmers regarding the processor pipeline: There is one load delay slot. There are two branch delay slots. There must be at least 2 instructions between a floating point computation instruction and a floating point store instruction that uses the value computed by the computation operation for correct operation to occur. You are given the inner loop of a program below. The instructions whose mnemonics start with the letter `F’ are floating point instructions that use floating point registers F0..F31. saxpy: ADD R5, R1, R3 FLOAD F2, 0(R5) Docsity.com
Docsity logo



Copyright © 2024 Ladybird Srl - Via Leonardo da Vinci 16, 10126, Torino, Italy - VAT 10816460017 - All rights reserved