Measuring the homogeneity and similarity of language corpora

Corpus-based methods are now dominant in Natural Language Processing (NLP). Creating big corpora is no longer difficult and the technology to analyze them is growing faster, more robust and more accurate. However, when an NLP application performs well on one corpus, it is unclear whether this level...

Full description

Saved in:
Bibliographic details
Main Author: Cavaglia, Gabriela Maria Chiara
Format: Dissertation
Language: English
Place of publication: University of Brighton
Online Access: available in Bonn?
Database: Networked Digital Library of Theses and Dissertations
Database information Databases - DBIS