ADMiRe: An Algebraic Data Mining Approach to System Performance Analysis

Document Type

Article

Publication Date

7-1-2005

Keywords

Algorithms for data and knowledge management, Data mining, Performance of systems

Abstract

Performance analysis of computing systems is an increasingly difficult task due to growing system complexity. Traditional tools rely on ad hoc procedures. With these, determining which of the manifold system and workload parameters to examine is often a lengthy and highly speculative process. The analysis is often incomplete and, therefore, prone to revealing faulty conclusions and not uncovering useful tuning knowledge. We address this problem by introducing a data mining approach called ADMiRe (Analyzer for Data Mining Results). In this scheme, regression analysis is first applied to performance data to discover correlations between various system and workload parameters. The results of this analysis are summarized in sets of regression rules. The user can then formulate intuitive algebraic expressions to manipulate these sets of rules to capture critical information. To demonstrate this approach, we use ADMiRe to analyze an Oracle database system running the TPC-C (Transaction Processing Performance Council) benchmark. The results generated by ADMiRe were confirmed by Oracle experts. We also show that by applying ADMiRe to Microsoft Internet Information Server performance data, we can improve system performance by 20 percent. © 2005 IEEE.

Journal Title

IEEE Transactions on Knowledge and Data Engineering

Volume

17

Issue

7

First Page

888

Last Page

901

DOI

https://doi.org/10.1109/TKDE.2005.103

First Department

Computing

Share

COinS