Measuring the homogeneity and similarity of language corpora

Corpus-based methods are now dominant in Natural Language Processing (NLP). Creating big corpora is no longer difficult and the technology to analyze them is growing faster, more robust and more accurate. However, when an NLP application performs well on one corpus, it is unclear whether this level...

Full description

Saved in:
Bibliographic details
Main Author: Cavaglia, Gabriela Maria Chiara
Format: Dissertation
Language: English
Place of publication: 01.07.2005
Data of publication: 2005-07
Online Access: available in Bonn?
Database: University of Brighton Research Repository
Database information Databases - DBIS