Class
|
Topic
|
Reading
Some Appendices are located on the publisher's website
|
Milestones
|
Dynamic Branch Prediction 4/1, 4/3
|
What this class is about
|
Review of pipelining
|
Section 3.1. If the review in lecture isn't jogging your memory, consult Appendix C.1-C.4.
|
Branch prediction
|
Appendix C, pp. 26-30; Sections 3.3 and 3.9, pp. 202-208
Project report guidelines,
and sample project report
|
Memory Hierarchy 4/8
|
Review of caching
|
|
No class. 4/10
|
|
|
Memory Hierarchy 4/15
|
Advanced caching techniques
|
Section 2.2
|
Execution Cores 4/17, 4/22, 4/24
|
Superscalars and static scheduling
|
|
Overview of dynamic scheduling
|
Section 3.4 to p.170, Section 2.8
|
Tomasulo's algorithm
|
Section 3.4, p.170 to its end;
Section 3.5
|
|
R10000-style dynamic scheduling (a physical register pool)
|
The
Smith/Sohi article for superscalars in a nutshell.
In the R10000 article
read from register mapping,
p. 32, through Register files, p. 35.
|
Branch prediction homework due Friday, April 25, 10:30.
|
Midterm 1 4/29
|
Exam is 1.5 hours
|
|
|
Multiprocessors 5/1, 5/6
|
Overview of multiprocessing
|
Section 5.1
|
Cache coherency
|
Section 5.2, Section 5.4
|
No class. 5/8
|
|
|
Multiprocessors 5/13, 5/15
|
Synchronization
|
Section 5.5
|
|
Wrap-up multiprocessing
|
|
First coherency milestone due, Tuesday, May 13, before class time.
|
Consistency models Brandon Lucia, a Microsoft researcher and former CSE grad student and 471 TA
|
Consistency Primer, pgs. 2-4, 17-27, 37-47
|
|
Multithreading 5/20
|
An Overview of Multithreading and Simultaneous Multithreading
|
Section 3.5 and the SMT paper
|
|
Data Parallelism 5/22
|
GPUs, Brad Beckmann, AMD
|
Section 4.1, 4.2, 4.4
|
|
Combining ILP and TP 5/27, 5/29
|
Dataflow Machines
|
After reading them over, I don't think any of the papers on the
early dataflow machines are appropriate for classroom use. There
are no general overview papers. So just listen to the lecture.
|
|
Wavescalar architecture and implementation
|
Skim The WaveScalar Architecture and
An overview of the WaveScalar implementation.
Use this to reinforce what we discuss in lecture; don't pay much attention to any
new material covered in these papers.
|
Coherency homework due, Thursday, May 29, before class.
|
The Wave of the Future 6/3
|
FPGAs -- Silicon for Procrastinators, Andrew Putnam (Microsoft Research, formerly CSE Grad Student)
|
This paper,
"Performance and Power of Cache-Based Reconfigurable Computing", discusses much of what will be in Andrew's talk. It will be useful for your report.
|
N Wrongs Make a Right: Saving Time and Energy with Approximate Computing, Adrian Sampson (CSE Grad Student)
|
This paper,
Architecture Support for Disciplined Approximate Computing, discusses much of what will be in Adrian's talk. It will be useful for your report.
|
|
MP Scaling 6/5
|
MP Scaling for Graph Algorithms
Jacob Nelson (CSE grad student)
|
This paper,
"Cora: A Latency-Tolerant Runtime for Large-Scale Irregular Applications" is just FYI.
You're not responsible for reading it.
|
Consistency homework due, Thursday, June 5, before class. Guest lecture write-up due the same time.
|
Midterm 2 6/9 10:30
|
Exam is 2 hours
|
|