Bonner Online-Bibliographie zur Comicforschung
Unser-Schutz, Giancarla: "Developing a text-based corpus of the language of japanese comics (manga)." In: John Newman, Harald Baayen und Sally Rice (Hrsg.): Corpus-based Studies in Language Use Language Learning and Language Documentation. (Language and Computers, 73.) Amsterdam [etc.]: Rodopi, 2011, S. 213–238.
Added by: joachim (23 Feb 2017 12:09:59 UTC) Last edited by: joachim (23 Feb 2017 12:16:22 UTC)
|Resource type: Book Article
BibTeX citation key: UnserSchutz2011a
Email resource to friend
View all bibliographic details
Keywords: Digitalisierung, Japan, Manga, Sprache
Creators: Baayen, Newman, Rice, Unser-Schutz
Publisher: Rodopi (Amsterdam [etc.])
Collection: Corpus-based Studies in Language Use Language Learning and Language Documentation
Views index: 1%
Popularity index: 0.25%
While demands for corpora from media which mix visual and linguistic elements have increased in recent years with developments in corpus-based linguistics research, the actual creation and design of such corpora present many unique problems. Most centrally, there remains much to be considered in terms of how to isolate and meaningfully represent their linguistic data. In line with these trends, in this paper I introduce a 687,654 character (55,415 entries) corpus of the language from Japanese comics (manga). Many of the issues encountered in its design are found with other media – newspaper stories, advertisements, political cartoons – which mix the visual with the linguistic. In addition to describing how such unusual text could be of interest to other researchers, the approaches taken here may help others with similar projects.