Hurtado, C., & Mendoza, M. (2011). Automatic Maintenance Of Web Directories By Mining Web Browsing Data. J. Web Eng., 10(2), 153–173.
Abstract: Web directories allow Web users to browse a hierarchy of categories, under which different types of resources are classified. We study the problem of maintaining a Web directory, that is, the problem of continually discovering and ranking resources that are relevant to the categories of the directory. We propose an unsupervised computational method that conducts the maintenance of the directory by analyses of user browsing data. The method is based on the extraction and classification of user sessions (sequences of resources selected by users) into the categories of the directory. In addition, we show that the directory maintenance method can be slightly modified to find queries that are useful to find relevant resources allowing users to switch from directory browsing to query formulation. Experimental results allow for affirmation that the proposed methods are effective, that they attain identification of new pages in each category and also recommend related queries with high precision, without; needing labeled data to conduct traditional web page and query classification tasks.
|
Vargas-Vera, M., Nagy, M., & De Pablos, P. O. (2013). A Framework For Detecting And Removing Knowledge Overlaps In A Collaborative Environment: Case Of Study A Computer Configuration Problem. J. Web Eng., 12(5), 422–438.
Abstract: This paper presents a framework for knowledge integration based on mappings between similar concepts in constraint graphs associated to a configuration problem. In particular, the paper deals with one of the problems which could arise when performing collaborative knowledge integration, namely detecting knowledge overlaps. The solution to the overlapping problem relies on the use of matching algorithms embedded in DSSim (short for Dempster-Shafer Similarity). To illustrate the approach, a case study of a computer configuration problem is presented. The solution to the knowledge overlap problem is important as it has the promise to become an alternative approach for the current knowledge integration solutions. Through our approach the real cost of integration can be reduced as it is not necessary to invest a great amount of resources beforehand a truly integrated system can be operational.
|