A Hybrid Approach for ImprovingWeb Document Clustering Based on Concept Mining

Petkar, Rajendra Bhimrao

Please use this identifier to cite or link to this item: http://172.22.28.37:8080/xmlui/handle/1/427

Title:	A Hybrid Approach for ImprovingWeb Document Clustering Based on Concept Mining
Authors:	Petkar, Rajendra Bhimrao
Keywords:	Web Mining Document Clustering Natural Language Processing, Concept mining
Issue Date:	2015
Publisher:	Rajarambapu Institute of Technology, Rajaramnagar
Abstract:	With the rapid development of internet, huge volumes of text data are also increasing. This rapidly growing data generate challenges for users like access, organize and analyze the required information expeditiously. Document clustering techniques mostly rely on the statistical analysis of a term. It can hard to identify, in situation when multiple terms have the same frequency value, but one term is more important in terms of meaning than the other. Also, process to discover more relevant information regarding user query on the web is uncontrollable. The proposed system tries to implement a concept based document clustering model that clusters the web documents based on the semantics or theme of the text data. The semantic analysis is done with the help of Semantic Role Labeler (SRL), to find the terms which contribute more to the meaning of the sentence. This system is called as a Concept Based Analysis Mechanism (CBAM). This underlying model provides robust and accurate document similarity calculation that leads to improved results in Web document clustering over traditional methods
Description:	Under the Guidance of Prof. S. S. Patil
URI:	http://localhost:8080/xmlui/handle/1/427
Appears in Collections:	M.Tech Computer Science & Engineering

Files in This Item:

File	Description	Size	Format
A Hybrid Approach for ImprovingWeb Document Clustering Based on Concept Mining.PDF Restricted Access		1.63 MB	Adobe PDF	View/Open Request a copy

Show full item record