Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

MIPS Instruction Set Architecture and Computer Systems Performance, Exams of Computer Architecture and Organization

Kannur University Computer Architecture and Organization

A series of problems and solutions related to computer systems performance, with a focus on the mips instruction set architecture. It covers topics such as instruction formats, addressing modes, branch conditions, and the design of a 32-bit multiply unit. It also includes a subroutine for unsigned 64-bit subtraction and a discussion on the performance of the pentium pro compared to the pentium.

Typology: Exams

2012/2013

Uploaded on 04/02/2013

shashikanth_0p3 🇮🇳

4.8

(8)

55 documents

1 / 11

This page cannot be seen from the preview

Don't miss anything!

Name: 1

Feb 21, 1996

University of California

College of Engineering

Computer Science Division -EECS

Sp 1996 D.E. Culler

CS 152 Midterm I

Your Name:________SOLUTION_______________

ID Number:_______________________________________________________

Discussion Section:__________________________________________________

You may bring two pages of notes and you may use a calculator, but no book or computer.

Please print you name clearly on the cover sheet and on every page. The point value of

each question is indicated in brackets. There are a total of 120 points. You have 170 min-

utes. Show your work. Write neatly and be well organized. It never hurts to make it easy to

grade.

Good luck.

Problem Possible Score

140

220

320

420

520

Total 120

Partial preview of the text

Download MIPS Instruction Set Architecture and Computer Systems Performance and more Exams Computer Architecture and Organization in PDF only on Docsity!

University of California

College of Engineering

Computer Science Division -EECS

Sp 1996 D.E. Culler

CS 152 Midterm I

Your Name:SOLUTION_______

ID Number:_______________________________________________________

Discussion Section:__________________________________________________

You may bring two pages of notes and you may use a calculator, but no book or computer.

Please print you name clearly on the cover sheet and on every page. The point value of

each question is indicated in brackets. There are a total of 120 points. You have 170 min-

utes. Show your work. Write neatly and be well organized. It never hurts to make it easy to

grade.

Good luck.

Problem Possible Score

Total 120

Problem 1 (40 points)

1a [3] State the five major components of a computer.

Processor datapath

Processor Control

Memory

Input

Output

1b [5] State five major distinct issues that must be addressed in an instruction set architec-

ture.

programmable storage

data types and encodings

set of operations

instruction formats

number of operands

where besides memory can operands be located

how memory operands are speified (addressing modes)

1c [2] Define Little Endian.

word is addressed by the byte address of the least significant byte (least

significant byte is at lowest address in the word.)

1d[3] Decode the following MIPS instruction using the opcode encoding table at the end

of the exam (Fig A.18) 10001100111001110000000000000111. Give its RTL (register

transfer language) meaning.

lw $7, 7($7) R[7] <– mem(R[7] + 7)

1e [3] What is the value of 1000 1100 1110 0111 0000 0000 0000 0111 as a 32bit 2s com-

plement number?

number is negative so, comp+1=> 0111 0011 0001 1000 1111 1111 1111 1001

-(2^30 + 2^29 + 2^28 + 2^25 + 2^24 + 2^20 + 2^19 + 2^16 - 7) = 1931018233

1f [3] What is the value of 10001100111001110000000000000111 as a single-precision

IEEE floating-point number?

1k[5]. Assume the NAND Gate has the following characteristics: Input load = 100fF,

propagation delay low-to-high TPlh = 0.5ns, TPhl = 0.1ns, TPlhf=0.002 ns/fF,

TPhlf=0.002ns/fF. Identify the critical path in the following cell and fully characterize it

using the linear delay model.

Input load is 100fF on A and B

TPlhf=0.002 ns/fF, TPhlf=0.002ns/fF

Fixed internal delay is a little wierd because output ends up being high, after a

glitch in one case. The important part was understanding the calculation

TPcell = (TPnand + 3 * TPfnand ) + (TPnand + 1* TPnand f) + TPnand

This gives

TPlh = (0.5 + 0.6) + (0.1 + 0.2) + 0.5 = 1.9 (or 0, since no change on output)

TPhl = (0.1 + 0.6) + (0.5 + 0.2) + 0.1 = 1.5 (glitches, then settles)

1l[3] Give the definition of speedup due an enhancement.

Speedup with E = (Time without E) / (Time with E)

= (Performance with E) / (Performance without E)

A

B

Q

Problem 2 (20 points).

This problem looks at performance and cost in the real world. The Feb 20, 1996 issue of

PC Magazine provides the following data in its “Pentium or Pro?” cover story.

CPUMark 32 and CPUMark 16 are indicators of performance (speed) similar to SPECMarks

(bigger is better) for 32 and 16 bit programs.

2a. [3] Assuming CPUmarks are indicative of performance on real programs on these

machines, how much faster is the Pentium Pro on 32 bit code? Show your work.

(a) 0.

(b) 0.

(c) 1.

(d) 1.

2b. [3] Assuming CPUmarks are indicative of performance on real programs on these

machines, how much faster is the Pentium Pro on 16bit code? Show your work.

(a) 0.

(b) 0.

(c) 1.

(d) 1.

2c. [2] How much faster in performance per dollar is the Pro on 32 and 16 bit code?

32bit:(430/7800) / ( 270/3750) = 0.76 times faster

16bit:(270/7800) / ( 276/3750) = 0.47 times faster

Pretty sad, isn’t it.

Pentium (P5) Pentium Pro (P6) Clock Rate 150 MHz 150 MHz Transistor Count 3.3 M 5.5 M Ave. System Price $3,750 $7, CPUMark 32 273 430 CPUMark 16 276 270

Speedup = Performance Pro / Performance P5= 430/

Speedup = Performance Pro / Performance P5= 270/

Problem 3. (20 points) Complete the skeleton of MIPS assembly language (with delayed

branches) below for the following C function. (Underlines may not be exactly right.)

extern int f (int);

int foo(int *A, int n) { int i = 0; int sum = 0; for (i = 0; i<n; i++) { sum = sum + f(A[i+1]); } return(sum); }

foo: subu $sp,$sp, sw $31,32($sp) sw $19,28($sp) # save A in $ sw $18,24($sp) # save n in $ sw $17,20($sp) # i in $ sw $16,16($sp) # sum in $ mov $16, $0 ; sum = 0 (2 pts for initialization) mov $17, $0 ; i = 0 slt $2, $17,$5 # i=0 < n (1 pt) beq $2,$0,$L3 # delayed branch fall-through mov $19, $4 # setup A (2 pts) mov $18, $5 # setup n $L7: # top of loop addi $4, $17, 1 # i+1 (2 pts) sll $4, $17, 2 # convert index to byte address (2 pts) addi $4, $19, $4 # &A[i+1] (2 pts) lw $4, 0($4) # fetch A[i+1] into argument register (2 pts) jal f # delayed jump and link addu $16, $16, $2 # accumulate return value into sum (2 pts) slt $2, $17, $18 # i < n (2 pts) bne $2,$0,$L7 # delayed branch to top of loop addi $17, $17, 1 # i++ $L3: # fall-through mov $2, $16 # return sum (1 pt) lw $31, 32($sp) lw $19, 28($fp) (2 pts) lw $18,24($sp) lw $17,20($sp) lw $16,16($sp) j $31 # delayed return jump addu $sp,$sp, .end foo

Problem 4 (20 points):

The Single-Cycle processor developed in class below (which was very similar to the one

in the book) supports the following instructions. (Note that, as in the virtual machine, the

branch is not delayed.)

Consider adding the following instructions to our subset: ADDIU, OR, AND, BLTZAL

(branch on less than Zero and Link). On the following pages, write the register transfers

for the new instructions. Sketch the modifications to the datapath and specify the control

points for each of the new instructions.

op | rs | rt | rd | shamt | funct = MEM[ PC ] op | rs | rt | Imm16 = inst Register Transfers ADDU R[rd] <– R[rs] + R[rt]; PC <– PC + 4 SUBU R[rd] <– R[rs] – R[rt]; PC <– PC + 4 ORi R[rt] <– R[rs] + zero_ext(Imm16); PC <– PC + 4 LOAD R[rt] <– MEM[ R[rs] + sign_ext(Imm16)]; PC <– PC + 4 STORE MEM[ R[rs] + sign_ext(Imm16)] <– R[rs]; PC <– PC + 4 BEQ if ( R[rs] == R[rt] ) then PC <– PC + sign_ext(Imm16)] || 00 else PC <– PC + 4

ALUctr

Clk

busW

RegWr

busA

busB

5 5 5

Rw Ra Rb 32 32-bit Registers

RegDst

Extender

Mux

imm16 16 32

ExtOp ALUSrc

Mux

MemtoReg

Clk

Data In

(^32) WrEnAdr

Data Memory

MemWr

ALU

Equal

Instruction<31:0>

1 0

<21:25><16:20><11:15><0:15>

Rt Rs Rd Imm

imm

Adder

Clk

00 Mux

nPC_sel

PC Ext

Adr

Inst Memory

Problem 4 (cont)

TABLE 1.

Ext ALUsrc ALUCtr MemWr Mem2Reg PC2Reg RegDst RegWr nPC ADDIU sign 1 add 0 0 0 0 1 0 OR x 0 OR 0 0 0 1 1 0 AND x 0 AND 0 0 0 1 1 0 BLTZAL x x x x x 1 2 1 LT

MIPS Instruction Set Architecture and Computer Systems Performance, Exams of Computer Architecture and Organization

Related documents

Partial preview of the text

Download MIPS Instruction Set Architecture and Computer Systems Performance and more Exams Computer Architecture and Organization in PDF only on Docsity!

University of California

College of Engineering

Computer Science Division -EECS

Sp 1996 D.E. Culler

CS 152 Midterm I

Your Name:________SOLUTION_______________

ID Number:_______________________________________________________

Discussion Section:__________________________________________________

You may bring two pages of notes and you may use a calculator, but no book or computer.

Please print you name clearly on the cover sheet and on every page. The point value of

each question is indicated in brackets. There are a total of 120 points. You have 170 min-

utes. Show your work. Write neatly and be well organized. It never hurts to make it easy to

grade.

Good luck.

Problem 1 (40 points)

1a [3] State the five major components of a computer.

Processor datapath

Processor Control

Memory

Input

Output

1b [5] State five major distinct issues that must be addressed in an instruction set architec-

ture.

programmable storage

data types and encodings

set of operations

instruction formats

number of operands

where besides memory can operands be located

how memory operands are speified (addressing modes)

1c [2] Define Little Endian.

word is addressed by the byte address of the least significant byte (least

significant byte is at lowest address in the word.)

1d[3] Decode the following MIPS instruction using the opcode encoding table at the end

of the exam (Fig A.18) 10001100111001110000000000000111. Give its RTL (register

transfer language) meaning.

lw $7, 7($7) R[7] <– mem(R[7] + 7)

1e [3] What is the value of 1000 1100 1110 0111 0000 0000 0000 0111 as a 32bit 2s com-

plement number?

number is negative so, comp+1=> 0111 0011 0001 1000 1111 1111 1111 1001

-(2^30 + 2^29 + 2^28 + 2^25 + 2^24 + 2^20 + 2^19 + 2^16 - 7) = 1931018233

1f [3] What is the value of 10001100111001110000000000000111 as a single-precision

IEEE floating-point number?

1k[5]. Assume the NAND Gate has the following characteristics: Input load = 100fF,

propagation delay low-to-high TPlh = 0.5ns, TPhl = 0.1ns, TPlhf=0.002 ns/fF,

TPhlf=0.002ns/fF. Identify the critical path in the following cell and fully characterize it

using the linear delay model.

Input load is 100fF on A and B

TPlhf=0.002 ns/fF, TPhlf=0.002ns/fF

Fixed internal delay is a little wierd because output ends up being high, after a

glitch in one case. The important part was understanding the calculation

TPcell = (TPnand + 3 * TPfnand ) + (TPnand + 1* TPnand f) + TPnand

This gives

TPlh = (0.5 + 0.6) + (0.1 + 0.2) + 0.5 = 1.9 (or 0, since no change on output)

TPhl = (0.1 + 0.6) + (0.5 + 0.2) + 0.1 = 1.5 (glitches, then settles)

1l[3] Give the definition of speedup due an enhancement.

Speedup with E = (Time without E) / (Time with E)

= (Performance with E) / (Performance without E)

A

B

Q

Problem 2 (20 points).

This problem looks at performance and cost in the real world. The Feb 20, 1996 issue of

PC Magazine provides the following data in its “Pentium or Pro?” cover story.

CPUMark 32 and CPUMark 16 are indicators of performance (speed) similar to SPECMarks

(bigger is better) for 32 and 16 bit programs.

2a. [3] Assuming CPUmarks are indicative of performance on real programs on these

machines, how much faster is the Pentium Pro on 32 bit code? Show your work.

(a) 0.

(b) 0.

(c) 1.

(d) 1.

2b. [3] Assuming CPUmarks are indicative of performance on real programs on these

machines, how much faster is the Pentium Pro on 16bit code? Show your work.

(a) 0.

(b) 0.

Your Name:SOLUTION_______