





| Shared Caches                                                                                                                                                                                                                                                              |   |  |
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---|--|
| <ul> <li>Share low level caches among multiple processors         <ul> <li>Sharing L1 adds to latency, <i>unless</i> multithreaded processor</li> </ul> </li> <li>Advantages</li> </ul>                                                                                    | 5 |  |
| <ul> <li>Eliminates need for coherence protocol at shared level</li> <li>Reduces latency within sharing group</li> <li>Processors essentially prefetch for each other</li> <li>Can exploit working set sharing</li> <li>Increases utilization of cache bardware</li> </ul> |   |  |
| <ul> <li>Disadvantages         <ul> <li>Higher bandwidth requirements</li> <li>Increased hit latency</li> <li>May be more complex design</li> <li>Lower effective capacity if working sets don't overlap</li> </ul> </li> </ul>                                            |   |  |
| <ul> <li>Bottom Line         <ul> <li>Packaging has a lot to do with it</li> <li>As levels of integrations increase, there will be more sharing</li> </ul> </li> </ul>                                                                                                     |   |  |
| (C) 2005 Babak Falsafi from Adve, Falsafi,<br>Hill, Lebeck, Reinhardt, Smith & Singh 18-742                                                                                                                                                                                |   |  |























































| Outline                                                                                                               |                       |  |
|-----------------------------------------------------------------------------------------------------------------------|-----------------------|--|
| Coherence Control Impleme                                                                                             | ntation               |  |
| Writebacks, Non-Atomicity,                                                                                            | & Serialization/Order |  |
| Hierarchical Cache                                                                                                    |                       |  |
| Split Buses                                                                                                           |                       |  |
| Deadlock, Livelock, & Starva                                                                                          | ition                 |  |
| Three Case Studies                                                                                                    |                       |  |
| TLB Coherence                                                                                                         |                       |  |
| • Virtual Cache Issues<br>(C) 2005 Babak Falsafi from Adve, Falsafi,<br>Hill, Lebeck, Reinhardt, Smith & Singh 18-742 | 32                    |  |













