HOMEE-SUBMISSIONSITEMAPCONTACT US

CORPUS LINGUSITICS RESEARCH

pISSN: 2465-812X

Journal SearchALL ISSUE

ALL ISSUE

Export Citation Download PDF PMC Previewer
Using Multi-Dimensional Analysis to Study Register Variation on the Searchable Web ×
  • EndNote
  • RefWorks
  • Scholar's Aid
  • BibTeX

Export Citation Cancel

CORPUS LINGUSITICS RESEARCH Vol.2 No. pp.1-23
Using Multi-Dimensional Analysis to Study Register Variation on the Searchable Web
Douglas Biber
Northern Arizona University
Jesse Egbert
Northern Arizona University
Key Words : register variation,web registers,multi-dimensional analysis,discourse domain

Abstract

Most previous linguistic studies of web language have focused on the ‘new' internet registers, like blogs, instant messages, and tweets. As a result, we know surprisingly little about the patterns of linguistic variation among the full range of registers found on the searchable web. The present paper provides an overview of a project that begins to fill this gap. Rather than collecting texts from only the ‘new' web registers, the project is based on a large corpus representing a random sample of the entire searchable web. The first analytical step in the project was to analyze the types of documents found in that corpus, providing an empirical description of the composition of the searchable web. Then, Multi-Dimensional (MD) analysis was applied to describe the patterns of register variation found on the searchable web. The MD analysis first identified the sets of co-occurring linguistic features -- the ‘dimensions' -- in this discourse domain. Then, those dimensions are used to document the similarities and differences among web registers. In conclusion, we compare our results here to previous MD studies, identifying patterns peculiar to the web versus linguistic patterns found across discourse domains.
LIST
Export citation