CLUTO

Software for clustering high-dimensional datasets
Download

CLUTO Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Freeware
  • Price:
  • FREE
  • Publisher Name:
  • George Karypis
  • Publisher web site:
  • http://glaros.dtc.umn.edu/gkhome/
  • Operating Systems:
  • Mac OS X
  • File Size:
  • 19.1 MB

CLUTO Tags


CLUTO Description

Software for clustering high-dimensional datasets CLUTO is a software package for clustering low- and high-dimensional datasets and for analyzing the characteristics of the various clusters. CLUTO is well-suited for clustering data sets arising in many diverse application areas including information retrieval, customer purchasing transactions, web, GIS, science, and biology.CLUTO's distribution consists of both a library and a stand-alone programs via which an application program can access directly the various clustering and analysis algorithms implemented in CLUTO. Here are some key features of "CLUTO": · Multiple classes of clustering algorithms: partitional, agglomerative, & graph-partitioning based. · Multiple similarity/distance functions: Euclidean distance, cosine, correlation coefficient, extended Jaccard, user-defined. · Numerous novel clustering criterion functions and agglomerative merging schemes. · Traditional agglomerative merging schemes: single-link, complete-link, UPGMA · Extensive cluster visualization capabilities and output options: postscript, SVG, gif, xfig, etc. · Multiple methods for effectively summarizing the clusters: most descriptive and discriminating dimensions, cliques, and frequent itemsets. · Can scale to very large datasets containing hundreds of thousands of objects and tens of thousands of dimensions. What's New in This Release: · Eliminated the limits on the length of the line of the input files. · Fixed spelling errors in the -help for vcluster/scluster. · Eliminated the 32 bit limit on the size of the dynamically allocated memory. CLUTO can now take advantage of 64 bit address space machines. · Builds for OSX (powerpc and i386) and Linux x86_64. · Reduced memory requirements for post-clustering reordering of the cluster numbers. · Performance improvements for some hierarchical agglomerative schemes. · Experimental support for multi-core processors and SMPs using OpenMP for MS Windows and Linux-i686. (See RELEASENOTES-2.1.2 for more information) · Redesigned the dynamic memory allocation scheme to be based on Doug Lea's malloc code. · An experimental set of new API calls is being provided that gracefully cleanup all internally allocated memory in case of critical errors and returns a code to the calling program indicating the type of the problem.


CLUTO Related Software