Difference between revisions of "Document classification"
Karl Jones (Talk | contribs) (Created page with "'''Document classification''' or '''document categorization''' is a problem in library science, information science and computer science. == Description == The t...") |
Karl Jones (Talk | contribs) (→External links) |
||
Line 39: | Line 39: | ||
* [https://en.wikipedia.org/wiki/Document_classification Document classification] @ Wikipedia | * [https://en.wikipedia.org/wiki/Document_classification Document classification] @ Wikipedia | ||
+ | |||
+ | [[Category:Computer science]] | ||
+ | [[Category:Data]] | ||
+ | [[Category:Information]] |
Latest revision as of 11:51, 21 April 2016
Document classification or document categorization is a problem in library science, information science and computer science.
Contents
Description
The task is to assign a document to one or more classes or categories.
This may be done:
- "Manually" (or "intellectually")
- {Algorithm|Algorithmically]]
The intellectual classification of documents has mostly been the province of library science, while the algorithmic classification of documents is mainly in information science and computer science.
The problems are overlapping, however, and there is therefore interdisciplinary research on document classification.
Systems of classification
The documents to be classified may be texts, images, music, etc.
Each kind of document possesses its special classification problems. When not otherwise specified, text classification is implied.
Documents may be classified according to their subjects or according to other attributes (such as document type, author, printing year etc.).
Content-based and request-based
There are two main philosophies of subject classification of documents:
- Content-based approach
- Request-based approach
See also
External links
- Document classification @ Wikipedia