Please use this identifier to cite or link to this item: http://172.22.28.37:8080/xmlui/handle/1/427
Title: A Hybrid Approach for ImprovingWeb Document Clustering Based on Concept Mining
Authors: Petkar, Rajendra Bhimrao
Keywords: Web Mining
Document Clustering
Natural Language Processing,
Concept mining
Issue Date: 2015
Publisher: Rajarambapu Institute of Technology, Rajaramnagar
Abstract: With the rapid development of internet, huge volumes of text data are also increasing. This rapidly growing data generate challenges for users like access, organize and analyze the required information expeditiously. Document clustering techniques mostly rely on the statistical analysis of a term. It can hard to identify, in situation when multiple terms have the same frequency value, but one term is more important in terms of meaning than the other. Also, process to discover more relevant information regarding user query on the web is uncontrollable. The proposed system tries to implement a concept based document clustering model that clusters the web documents based on the semantics or theme of the text data. The semantic analysis is done with the help of Semantic Role Labeler (SRL), to find the terms which contribute more to the meaning of the sentence. This system is called as a Concept Based Analysis Mechanism (CBAM). This underlying model provides robust and accurate document similarity calculation that leads to improved results in Web document clustering over traditional methods
Description: Under the Guidance of Prof. S. S. Patil
URI: http://localhost:8080/xmlui/handle/1/427
Appears in Collections:M.Tech Computer Science & Engineering

Files in This Item:
File Description SizeFormat 
A Hybrid Approach for ImprovingWeb Document Clustering Based on Concept Mining.PDF
  Restricted Access
1.63 MBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.