Document Type

Article

Publication Date

10-2007

Publication Source

Journal of the American Society for Information Science and Technology (JASIST)

Abstract

We present an approach to enhancing information access through Web structure mining in contrast to traditional approaches involving usage mining. Specifically, we mine the hardwired hierarchical hyperlink structure of Web sites to identify patterns of term-term co-occurrences we call Web functional dependencies (FDs). Intuitively, a Web FD ‘x y’ declares that all paths through a site involving a hyperlink labeled x also contain a hyperlink labeled y. The complete set of FDs satisfied by a site help characterize (flexible and expressive) interaction paradigms supported by a site, where a paradigm is the set of explorable sequences therein. We describe algorithms for mining FDs and results from mining several hierarchical Web sites and present several interface designs that can exploit such FDs to provide compelling user experiences.

Inclusive pages

1805-1819

ISBN/ISSN

1532-2882

Document Version

Postprint

Comments

Author's accepted version included in repository in compliance with publisher policies on self-archiving; permission documentation is on file. The version of record is available online from the publisher.

Publisher

John Wiley & Sons Inc.

Volume

58

Issue

12

Peer Reviewed

yes