Posts tagged: conference

KDD2011: 17th ACM SIGKDD Conference on KDD

comments Comments Off
By Volkan TUNALI, October 18, 2010 12:41 am

KDD2011The annual ACM SIGKDD conference is the premier international forum for data mining researchers and practitioners from academia, industry, and government to share their ideas, research results and experiences. KDD-2011 will feature keynote presentations, oral paper presentations, poster sessions, workshops, tutorials, panels, exhibits, demonstrations, and the KDD Cup competition.

KDD-2011 will run between from August 21-24 in San Diego, CA and will feature hundreds of practitioners and academic data miners converging on the one location.

Important Dates

  • Aug 21-24, 2011 KDD-2011 Conference
  • May 13, 2011 Paper acceptance
  • Feb 18, 2011 Full Paper deadline
  • Feb 11, 2011 Paper abstract deadline

* All deadlines are for 11:59 PM Pacific time.

For more information, you can visit the conference home page.

ICDM’10: The 10th IEEE International Conference on Data Mining

comments Comments Off
By Volkan TUNALI, September 20, 2010 5:42 pm

Sponsored by the IEEE Computer Society
December 14-17, 2010, Sydney, Australia
http://datamining.it.uts.edu.au/icdm10

Important Dates
*****************
Apr 14, 2010: Deadline for workshop proposals
May 06, 2010: Deadline for ICDM contest proposals
Jul 02, 2010: Deadline for full paper submissions
Jul 13, 2010: Deadline for demo and tutorial proposals
Jul 23, 2010: Deadline for 18 Workshop paper submissions (extended to Aug 9th)
Sep 17, 2010: Notification of acceptance of full papers
Sep 20, 2010: Notification of acceptance of workshop papers
Oct 11, 2010: Camera-ready copies and copyright forms
(11:59pm Hawaii time)

The IEEE International Conference on Data Mining (ICDM) has established itself as the world’s premier research conference in data mining. The 10th edition of ICDM (ICDM ’10) provides a leading forum for presentation of original research results, as well as exchange and dissemination of innovative, practical development experiences. The conference covers all aspects of data mining, including algorithms, software and systems, and applications. In addition, ICDM draws researchers and application developers from a wide range of data mining related areas such as statistics, machine learning, pattern recognition, databases and data warehousing, data visualization, knowledge-based systems, and high performance computing. By promoting novel, high quality research findings, and innovative solutions to challenging data mining problems, the conference seeks to continuously advance the state-of-the-art in data mining. Besides the technical program, the conference will feature invited talks from Christos Faloutsos (CMU), Geoff McLachlan (UQ) and Xindong Wu (UVM), workshops, tutorials, panels, and the ICDM data mining contest.

RapidMiner Community Meeting And Conference – RCOMM 2010

comments Comments Off
By Volkan TUNALI, July 18, 2010 1:54 am

RapidMinerRapid-I hosts the first RapidMiner Community Meeting And Conference (RCOMM 2010) and invites users and developers of RapidMiner to take part and share their RapidMiner experiences with other members of the community.

Important Dates

Submission Deadline: August 6, 2010
Notification of Acceptance: August 13, 2010
Camera-ready Papers: August 20, 2010
Conference: September 13 – 16, 2010

Location

University of Dortmund, Germany

More Info & Registration

You can visit the conference home page.

Cluto vs. Gmeans – An Empirical Comparison

comments Comments Off
By Volkan TUNALI, July 16, 2010 5:55 pm

Text MiningLast month I attended 1st International Symposium on Computing in Science and Engineering, held by Gediz University in Kusadasi, Turkey, with a paper and a presentation.

My topic was “An empirical comparison of fast and efficient tools for mining textual data“. In this paper we evaluate and compare two state-of-the-art data mining tools for clustering high-dimensional text data, Cluto and Gmeans.

The abstract of the paper is below:

In order to effectively manage and retrieve the information comprised in vast amount of text documents, powerful text mining tools and techniques are essential. In this paper we evaluate and compare two state-of-the-art data mining tools for clustering high-dimensional text data, Cluto and Gmeans. Several experiments were conducted on three benchmark datasets, and results are analysed in terms of clustering quality, memory and CPU time consumption. We empirically show that Gmeans offers high scalability by sacrificing clustering quality while Cluto presents better clustering quality at the expense of memory and CPU time.

Keywords: text mining, document clustering, spherical k-means, bisecting k-means

About Cluto

Written in ANSI C by George Karypis, CLUTO (CLUstering TOolkit) is a software package for clustering low- and high-dimensional datasets and for analyzing the characteristics of the various clusters.

Cluto contains partitional, agglomerative, and graph-partitioning based clustering algorithms. Bisecting k-means is the default option from the partitional class of algorithms, which is under consideration of the paper. In addition, Cluto offers multiple distance (similarity) functions like cosine, euclidean, correlation coefficient, extended Jaccard, where cosine is the default option. Cluto also has an option to select one of several clustering criterion functions from four categories: internal, external, hybrid, and graph-based.

About Gmeans

Gmeans is a C++ program for clustering, developed by Yuqiang Guan as part of his PhD thesis. The program employs four different k-means type clustering algorithms with four different distance (similarity) measures: cosine, euclidean, diametric distance, and Kullback-Leibler divergence, where cosine is the default similarity measure applied for spherical k-means, with each document vector to be (L2) normalised. Moreover, a local search strategy to overcome the local optima problem, called first variation, is also included. The program generates one-way, hard-clustering of a given dataset.

Download the Paper

If you are interested in the details of this comparison such as datasets used and experiments performed, you can freely download the paper here (.pdf inside .rar archive 281K).

View the Presentation

KDD 2010: 16th ACM SIGKDD Conference on Knowledge Discovery and DM

comments Comments Off
By Volkan TUNALI, July 1, 2010 1:50 pm

KDD LogoKDD-2010 will run between from July 25-28 in Washington, DC and will feature hundreds of practitioners and academic data miners converging on the one location.

The annual ACM SIGKDD conference is the premier international forum for data mining researchers and practitioners from academia, industry, and government to share their ideas, research results and experiences. KDD-2010 will feature keynote presentations, oral paper presentations, poster sessions, workshops, tutorials, panels, exhibits, demonstrations, and the KDD Cup competition.

For more information you can visit the conference web site.

Panorama Theme by Themocracy