Conference Proceeding

A table-based method for single-pass cache optimization.

01/2008; pp.71-76 In proceeding of: Proceedings of the 18th ACM Great Lakes Symposium on VLSI 2008, Orlando, Florida, USA, May 4-6, 2008
Source: DBLP
0 0
 · 
0 Bookmarks
 · 
64 Views
  • Conference Proceeding: A performance evaluation of memory hierarchy in embedded systems
    [show abstract] [hide abstract]
    ABSTRACT: The increasing speed gap between processors and memory makes the design of memory hierarchy one of the critical issues in general purpose embedded systems. As memory requirements for embedded applications grow, especially in emerging area of handheld multimedia devices, cache memories become crucial for providing high performance and reducing power. This paper describes a performance evaluation of typical cache design issues such as cache size and organization, block size, and replacement policy. The evaluation is done using simulation tools for architectural exploration based on ARM instruction set and MiBench benchmark suite. Our performance evaluation includes monitoring of dynamic cache behavior, since embedded systems designers are interested not only in the total number of cache misses, but also in the number of cache misses throughout application execution.
    System Theory, 2003. Proceedings of the 35th Southeastern Symposium on; 04/2003
  • Source
    Article: Selective Cache Ways: On-Demand Cache Resource Allocation
    [show abstract] [hide abstract]
    ABSTRACT: Increasing levels of microprocessor power dissipation call for new approaches at the architectural level that save energy by better matching of on-chip resources to application requirements. Selective cache ways provides the ability to disable a subset of the ways in a set associative cache during periods of modest cache activity, while the full cache may remain operational for more cache-intensive periods. Because this approach leverages the subarray partitioning that is already present for performance reasons, only minor changes to a conventional cache are required, and therefore, full-speed cache operation can be maintained. Furthermore, the tradeoff between performance and energy is flexible, and can be dynamically tailored to meet changing application and machine environmental conditions. We show that trading off a small performance degradation for energy savings can produce a significant reduction in cache energy dissipation using this approach. 1. Introduction Contin...
    06/2000;
  • Source
    Conference Proceeding: StatCache: a probabilistic approach to efficient and accurate data locality analysis
    [show abstract] [hide abstract]
    ABSTRACT: The widening memory gap reduces performance of applications with poor data locality. Therefore, there is a need for methods to analyze data locality and help application optimization. In this paper we present StatCache, a novel sampling-based method for performing data-locality analysis on realistic workloads. StatCache is based on a probabilistic model of the cache, rather than a functional cache simulator. It uses statistics from a single run to accurately estimate miss ratios of fully-associative caches of arbitrary sizes and generate working-set graphs. We evaluate StatCache using the SPEC CPU2000 benchmarks and show that StatCache gives accurate results with a sampling rate as low as 10<sup>-4</sup>. We also provide a proof-of-concept implementation, and discuss potentially very fast implementation alternatives.
    Performance Analysis of Systems and Software, 2004 IEEE International Symposium on - ISPASS; 02/2004

Full-text

View
3 Downloads
Available from