The compilation of a corpus is an important factor which greatly influences the results obtained in any research study. While the Internet is currently the principal source of texts for corpus creation, the fact that anybody can publish on the web without any kind of revision means that researchers must always ensure that texts come from reliable sites, and must continually assess the quality of textual resources. This paper describes a set of evaluation parameters used by the authors to assess website validity. our evaluation protocol is composed of three parameters, namely authority, content and design, each of which is divided into a set of sub- parameters. By applying this evaluation protocol to website texts, corpus quality may be assured. In addition, the protocol may be extended to assure the quality of corpora in other domains.
Benzer Makaleler | Yazar | # |
---|
Makale | Yazar | # |
---|