

















| Design Tradeoffs                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |    |  |  |  |  |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|--|--|--|--|
| <ul> <li>Memory update policy</li> <li>write-through <ul> <li>performance depends on the # of writes</li> <li>store buffer decreases this</li> <li>check on load misses</li> <li>store compression</li> </ul> </li> <li>write-back <ul> <li>performance depends on the # of dirty block replacements but</li> <li>dirty bit &amp; logic for checking it</li> <li>tag check before the write</li> <li>must flush the cache before I/O</li> <li>optimization: fetch before replace</li> </ul> </li> <li>both use a merging store buffer</li> </ul> |    |  |  |  |  |
| Spring 2010 CSE 471 - Memory Hierarchy                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | 10 |  |  |  |  |











|                                                 | Design Tradeoffs                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |    |
|-------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|
| • prod<br>- synony<br>• "th<br>•<br>• a so<br>• | <pre>b flush the cache on a context switch<br/>cess identification (PID) can avoid this<br/>ms<br/>e synonym problem"<br/>if 2 processes are sharing data, two (different) virtual<br/>addresses map to the same physical address<br/>2 copies of the same data in the cache<br/>on a write, only one will be updated; so the other has old data<br/>blution: page coloring<br/>processes share segments; all shared data have the same<br/>offset from the beginning of a segment, i.e., the same low-<br/>order bits<br/>cache must be &lt;= the segment size<br/>(more precisely, each set of the cache must be &lt;= the<br/>segment size)</pre> | 3  |
| Spring 2010                                     | index taken from segment offset, tag compare on segment #<br>CSE 471 - Memory Hierarchy                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | 16 |

















| Measuring Cache Hierarchy Performance |                                               |                                                             |    |  |  |  |
|---------------------------------------|-----------------------------------------------|-------------------------------------------------------------|----|--|--|--|
| Global Mis                            | ss Ratio:                                     | globalMR = #misses in cache<br>#references generated by CPU |    |  |  |  |
| Example:                              | 1000 Referenc<br>40 L1 misses<br>10 L2 misses | es                                                          |    |  |  |  |
| global MR                             | (L1):                                         |                                                             |    |  |  |  |
| global MR                             | (L2):                                         |                                                             |    |  |  |  |
|                                       |                                               |                                                             |    |  |  |  |
| Spring 2010                           |                                               | CSE 471 - Memory Hierarchy                                  | 25 |  |  |  |

