TY - GEN
T1 - Toward behavioral modeling of a grid system
T2 - 17th IEEE International Conference on Data Mining Workshops, ICDM Workshops 2007
AU - Zhang, Xiangliang
AU - Sebag, Michèle
AU - Germain, Cécile
PY - 2007
Y1 - 2007
N2 - Grid systems are complex heterogeneous systems, and their modeling constitutes a highly challenging goal. This paper is interested in modeling the jobs handled by the EGEE grid, by mining the Logging and Bookkeeping files. The goal is to discover meaningful job clusters, going beyond the coarse categories of "successfully terminated jobs" and "other jobs". The presented approach is a three-step process: i) Data slicing is used to alleviate the job heterogeneity and afford discriminant learning; ii) Constructive induction proceeds by learning discriminant hypotheses from each data slice; iii) Finally, double clustering is used on the representation built by constructive induction; the clusters are fully validated after the stability criteria proposed by Meila (2006). Lastly, the job clusters are submitted to the experts and some meaningful interpretations are found.
AB - Grid systems are complex heterogeneous systems, and their modeling constitutes a highly challenging goal. This paper is interested in modeling the jobs handled by the EGEE grid, by mining the Logging and Bookkeeping files. The goal is to discover meaningful job clusters, going beyond the coarse categories of "successfully terminated jobs" and "other jobs". The presented approach is a three-step process: i) Data slicing is used to alleviate the job heterogeneity and afford discriminant learning; ii) Constructive induction proceeds by learning discriminant hypotheses from each data slice; iii) Finally, double clustering is used on the representation built by constructive induction; the clusters are fully validated after the stability criteria proposed by Meila (2006). Lastly, the job clusters are submitted to the experts and some meaningful interpretations are found.
UR - http://www.scopus.com/inward/record.url?scp=49649115109&partnerID=8YFLogxK
U2 - 10.1109/ICDMW.2007.52
DO - 10.1109/ICDMW.2007.52
M3 - Conference contribution
AN - SCOPUS:49649115109
SN - 0769530192
SN - 9780769530192
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 581
EP - 586
BT - ICDM Workshops 2007 - Proceedings of the 17th IEEE International Conference on Data Mining Workshops
Y2 - 28 October 2007 through 31 October 2007
ER -