DSpace
 

DSpace at IIT Bombay >
IITB Publications >
Proceedings papers >

Please use this identifier to cite or link to this item: http://dspace.library.iitb.ac.in/jspui/handle/100/1933

Title: Document classification through interactive supervision of document and term labels
Authors: GODBOLE, S
HARPALE, A
SARAWAGI, S
CHAKRABARTI, S
Issue Date: 2004
Publisher: SPRINGER-VERLAG BERLIN
Citation: KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2004, PROCEEDINGS,3202,185-196
Abstract: Effective incorporation of human expertise, while exerting a low cognitive load, is a critical aspect of real-life text classification applications that is not adequately addressed by batch-supervised high-accuracy learners. Standard text classifiers are supervised in only one way: assigning labels to whole documents. They are thus deprived of the enormous wisdom that humans carry about the significance of words and phrases in context. We present HIClass, an interactive and exploratory labeling package that actively collects user opinion on feature representations and choices, as well as whole-document labels, while minimizing redundancy in the input sought. Preliminary experience suggests that, starting with essentially an unlabeled corpus, very little cognitive labor suffices to set up a labeled collection on which standard classifiers perform well.
URI: http://dspace.library.iitb.ac.in/xmlui/handle/10054/15170
http://hdl.handle.net/100/1933
ISBN: 3-540-23108-0
ISSN: 0302-9743
Appears in Collections:Proceedings papers

Files in This Item:

There are no files associated with this item.

View Statistics

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback