Notice: Undefined index: linkPowrot in C:\wwwroot\wwwroot\publikacje\publikacje.php on line 1275
Publikacje
Pomoc (F2)
[21470] Artykuł:

Generalized SOMs with Splitting-Merging Tree-Like Structures for WWW-Document Clustering

Czasopismo: Advances in Intelligent Systems Research   Tom: 89, Strony: 186-193
ISSN:  1951-6851
ISBN:  978-94-62520-77-6
Wydawca:  ATLANTIS PRESS, 29 AVENUE LAVMIERE, PARIS, 75019, FRANCE
Opublikowano: 2015
Seria wydawnicza:  Advances in Intelligent Systems Research
 
  Autorzy / Redaktorzy / Twórcy
Imię i nazwisko Wydział Katedra Procent
udziału
Liczba
punktów
Marian Bolesław Gorzałczany orcid logoWEAiIKatedra Informatyki, Elektroniki i Elektrotechniki *337.50  
Filip Rudziński orcid logoWEAiIKatedra Informatyki, Elektroniki i Elektrotechniki *337.50  
Jakub Piekoszewski orcid logoPolitechnika Świętokrzyska w Kielcach; Wydział Elektrotechniki, Automatyki i Informatyki33.00  

Grupa MNiSW:  Materiały z konferencji międzynarodowej (zarejestrowane w Web of Science)
Punkty MNiSW: 15
Klasyfikacja Web of Science: Proceedings Paper


Pełny tekstPełny tekst     DOI LogoDOI     Web of Science Logo Web of Science    
Keywords:

WWW-document clustering  generalized SOMs with tree-like structures  cluster analysis  unsupervised learning  



Abstract:

This paper presents our clustering technique based on generalized SOMs with evolving splittingmerging tree-like structures and its application to complex clustering problems including some benchmark data sets and, first of all, WWW-document clustering. Our approach that works in a fully unsupervised way (i.e., without the pre-defined cluster number and using unlabelled data), automatically detects the number of clusters and generates multiprototypes for them. The collection of 548 abstracts of technical reports as well as its 476-element subset, both available at WWW server of the Department of Computer Science, University of Rochester, USA (www.cs.rochester.edu/trs) are the subjects of clustering. A comparative analysis with five alternative clustering techniques is also carried out. The reported results prove that our approach is a powerful tool (that outperforms several alternative approaches) for complex cluster-analysis tasks including the problems of WWW-document clustering.