Literatur
Unicode Home Page, http://www.unicode.org/
The CJK Dictionary Institute, http://www.cjk.org/cjk/index.htm
Ken Lunde(1998), CJKV Information Processing
Ken Lunde(1999), Perl and Multiple-Byte Characters,
http://examples.oreilly.com/cjkvinfo/perl/svpm99-paper.pdf
Chen, Aitao(2003), Chinese Word Segmentation Using Minimal Linguistic
Knowledge. http://metadata.sims.berkeley.edu/papers/sighan03.pdf.
Chooi-Ling Goh, Masayuki Asahara, and Yuji Matsumoto(2005)
Chinese Word Segmentation by Classification of Characters: Erscheint
in Computational Linguistics and Chinese Language Processing
Liu(2005), The Domain-Adaptive Chinese Word Segmentation
Peng, Fuchen, Fangfang Feng & Andrew McCallum(2004), Chinese
Segmentation and New Word Detection using Conditional Random
Fields. http://www.cs.umass.edu/ mccallum/papers/coling04.pdf
Sproat, Richard & Thomas Emerson(2003) The First International Chinese
Word Segmentation Bakeoff. http://www.sighan.org/bakeoff2003/paper.pdf.
Ando, Rie Kubota & Lilian Lee(2003), Mostly- Unsupervised Statiscal
Segmentation of Japanese Kanji Sequences. Natural Language
Engineering 9(2): 127-149
Masayuki Asahara & Yuji Matsumoto(2004),Japanese Unknown Word
Identification by Character-based Chunking. In: COLING-2004.
Chooi-Ling Goh, Masayuki Asahara & Yuji Matsumoto(2005), Building
a Japanese-Chinese Dictionary Using Kanji/Hanzi Conversion. In: IJCNLP-2005
Kang,S.-S. & C.-W. Woo(2001), Automatic Segmentation of words using syllable
bigram statistics. In Proceedings of the 6th Natural Language Processing
Pacific Rim Symposium, p. 729-732.
Lee, Do-Gil, Hae-Chang Rim & Heui-Seok Lim(2003), A Syllable Based Word
Recognition Model for Korean Noun Extraction. In: Proceedings of the 41st
Annual Meeting of the Association for Computational Linguistics. P03-1060.
1