A Fuzzy Approach for Feature Evaluation and Dimensionality Reduction to Improve the Quality of Web Usage Mining Results
Zahid Ansari, M.F.Azeem, A. Vinaya Babu, Waseem Ahmed

TL;DR
This paper introduces a fuzzy set theoretic method for feature evaluation and dimensionality reduction in web usage mining, improving clustering quality by assigning weights rather than removing data.
Contribution
It proposes a novel fuzzy approach for feature selection and session weighting, enhancing clustering performance over traditional elimination methods.
Findings
Fuzzy weighting improves cluster validity indices.
The approach retains more information than direct elimination.
Results show better clustering performance.
Abstract
Web Usage Mining is the application of data mining techniques to web usage log repositories in order to discover the usage patterns that can be used to analyze the users navigational behavior. During the preprocessing stage, raw web log data is transformed into a set of user profiles. Each user profile captures a set of URLs representing a user session. Clustering can be applied to this sessionized data in order to capture similar interests and trends among users navigational patterns. Since the sessionized data may contain thousands of user sessions and each user session may consist of hundreds of URL accesses, dimensionality reduction is achieved by eliminating the low support URLs. Very small sessions are also removed in order to filter out the noise from the data. But direct elimination of low support URLs and small sized sessions may results in loss of a significant amount of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
