Bonner Online-Bibliographie zur Comicforschung
Unser-Schutz, Giancarla: "Developing a text-based corpus of the language of Japanese comics (manga)." In: Corpus-based Studies in Language Use Language Learning and Language Documentation. Hrsg. v. John Newman, Harald Baayen und Sally Rice. (Language and Computers, 73.) Amsterdam [etc.]: Rodopi, 2011, S. 213–238.
|Resource type: Book Article
BibTeX citation key: UnserSchutz2011a
View all bibliographic details
Keywords: Digitalisierung, Japan, Manga, Sprache
Creators: Baayen, Newman, Rice, Unser-Schutz
Publisher: Rodopi (Amsterdam [etc.])
Collection: Corpus-based Studies in Language Use Language Learning and Language Documentation
While demands for corpora from media which mix visual and linguistic elements have increased in recent years with developments in corpus-based linguistics research, the actual creation and design of such corpora present many unique problems. Most centrally, there remains much to be considered in terms of how to isolate and meaningfully represent their linguistic data. In line with these trends, in this paper I introduce a 687,654 character (55,415 entries) corpus of the language from Japanese comics (manga). Many of the issues encountered in its design are found with other media – newspaper stories, advertisements, political cartoons – which mix the visual with the linguistic. In addition to describing how such unusual text could be of interest to other researchers, the approaches taken here may help others with similar projects.
PHP execution time: 0.01293 s
SQL execution time: 0.09528 s
TPL rendering time: 0.00163 s
Total elapsed time: 0.10984 s
Peak memory usage: 1.3198 MB
Memory at close: 1.2695 MB