Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

CiteULike is a free service for managing and discovering scholarly references - click here to get started.

Sign In to gain access to subscriptions and/or personal tools.
Journal of English Linguistics
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Reppen, R.
Right arrow Articles by Ide, N.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

The American National Corpus

Overall Goals and the First Release

Randi Reppen

Northern Arizona University

Nancy Ide

Vassar College

The American National Corpus (ANC) will be a carefully designed corpus of 100 million words of American written and spoken language that generally follows the framework of the British National Corpus. The ANC project will provide both a standard format for text encoding and a format for different types of corpus annotation (e.g., parts of speech, rhetorical features, etc.), as well as different versions of the same type of annotation (e.g. multiple part of speech taggings). As the only widely available large corpus of spoken and written American English containing a variety of registers, the ANC will represent a synchronic slice of American English across many registers. The First Release of the ANC, described in this article, is a preview of the corpus and a chance for researchers to contribute feedback on format and related issues, while allowing them access to data rather than waiting until the entire corpus is completed.

Key Words: American National Corpus • corpus linguistics • computational linguistics • encoding • annotation

Journal of English Linguistics, Vol. 32, No. 2, 105-113 (2004)
DOI: 10.1177/0075424204264856


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Journal of English LinguisticsHome page
I. WonHo Yoo
A Corpus Analysis of (The) Last/Next + Temporal Nouns
Journal of English Linguistics, March 1, 2008; 36(1): 39 - 61.
[Abstract] [PDF]


Home page
Language Teaching ResearchHome page
J. Shirato and P. Stapleton
Comparing English vocabulary in a spoken learner corpus with a native speaker corpus: Pedagogical implications arising from an empirical study in Japan
Language Teaching Research, October 1, 2007; 11(4): 393 - 412.
[Abstract] [PDF]


Home page
Journal of English LinguisticsHome page
L. D. Antieau
Book Review: Language in the U.S.A.: Themes for the Twenty-First Century
Journal of English Linguistics, December 1, 2005; 33(4): 370 - 374.
[PDF]


Home page
American SpeechHome page
C. F. MEYER
ADS ANNUAL LECTURE: CAN YOU REALLY STUDY LANGUAGE VARIATION IN LINGUISTIC CORPORA?
American Speech, December 1, 2004; 79(4): 339 - 355.
[Abstract] [PDF]