Bereichsnavigation

Publications+
- Publications in 2021
- Publications in 2020
- Publications in 2019
- Publications in 2018
- Publications in 2017
- Publications in 2016
- Publications in 2015
- Publications in 2014
- Publications in 2013
- Publications in 2012
  - Efficient Frequent Item Counting in Multi-Core Hardware
  - - KDD 2012 Reviews
    - (P)VLDB 2012 Reviews
  - Sorting Networks on FPGAs
  - Skeleton Automata for FPGAs: Reconfiguring without Reconstructing
  - MXQuery With Hardware Acceleration
- Publications in 2011
- Publications in 2010
- Publications in 2009
- Publications in 2008
- Publications in 2007
- Publications in 2006
- Publications in 2005
- Publications in 2004
- Publications in 2003
- Publications in 2001
- Bachelor, Master, and Diploma Theses

Hauptinhalt

Efficient Frequent Item Counting in Multi-Core Hardware

Publication Details

Title

Efficient Frequent Item Counting in Multi-Core Hardware

Authors

Pratanu Roy, Jens Teubner, and Gustavo Alonso

Published

Proceedings of the 2012 ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2012), Beijing, China, August 2012.

Download

paper (PDF)

Abstract

The increasing number of cores and the rich instruction sets of modern hardware are opening up new opportunities for optimizing many traditional data mining tasks. In this paper we demonstrate how to speed up the performance of the computation of frequent items by almost one order of magnitude over the best published results by matching the algorithm to the underlying hardware architecture.

We start with the observation that frequent item counting, like other data mining tasks, assumes certain amount of skew in the data. We exploit this skew to design a new algorithm that uses a pre-filtering stage that can be implemented in a highly efficient manner through SIMD instructions. Using pipelining, we then combine this pre-filtering stage with a conventional frequent item algorithm (Space-Saving) that will process the remainder of the data. The resulting operator can be parallelized with a small number of cores, leading to a parallel implementation that does not suffer any of the overheads of existing parallel solutions when querying the results and offers significantly higher throughput.

Publication Log

May 2012

camera-ready for KDD 2012

camera-ready paper (PDF)

February 2012

submission to KDD 2012 (accepted)

submission (PDF)
reviews (results: “I will argue to accept”, “Leave for senior PC to decide”, “Leave for senior PC to decide”)

December 2011

submission to (P)VLDB 2012 (rejected)

submission (PDF)
reviews (results: reject, reject, reject)

Nebeninhalt

Kontakt

Prof. Dr. Jens Teubner
Tel.: 0231 755-6481

Sprungmarken

Servicenavigation

Hauptnavigation

Bereichsnavigation

Hauptinhalt

Efficient Frequent Item Counting in Multi-Core Hardware

Publication Details

Title

Authors

Published

Download

Abstract

Publication Log

May 2012

February 2012

December 2011

Nebeninhalt

Kontakt