Docstoc

Literatur Literatur Unicode Home Page http www unicode org The

Document Sample
Literatur Literatur Unicode Home Page http www unicode org The Powered By Docstoc
					                                   Literatur


Unicode Home Page, http://www.unicode.org/
The CJK Dictionary Institute, http://www.cjk.org/cjk/index.htm

Ken Lunde(1998), CJKV Information Processing
Ken Lunde(1999), Perl and Multiple-Byte Characters,
    http://examples.oreilly.com/cjkvinfo/perl/svpm99-paper.pdf

Chen, Aitao(2003), Chinese Word Segmentation Using Minimal Linguistic
     Knowledge. http://metadata.sims.berkeley.edu/papers/sighan03.pdf.
Chooi-Ling Goh, Masayuki Asahara, and Yuji Matsumoto(2005)
     Chinese Word Segmentation by Classification of Characters: Erscheint
     in Computational Linguistics and Chinese Language Processing
Liu(2005), The Domain-Adaptive Chinese Word Segmentation
Peng, Fuchen, Fangfang Feng & Andrew McCallum(2004), Chinese
     Segmentation and New Word Detection using Conditional Random
     Fields. http://www.cs.umass.edu/ mccallum/papers/coling04.pdf
Sproat, Richard & Thomas Emerson(2003) The First International Chinese
     Word Segmentation Bakeoff. http://www.sighan.org/bakeoff2003/paper.pdf.

Ando, Rie Kubota & Lilian Lee(2003), Mostly- Unsupervised Statiscal
    Segmentation of Japanese Kanji Sequences. Natural Language
    Engineering 9(2): 127-149
Masayuki Asahara & Yuji Matsumoto(2004),Japanese Unknown Word
    Identification by Character-based Chunking. In: COLING-2004.
Chooi-Ling Goh, Masayuki Asahara & Yuji Matsumoto(2005), Building
    a Japanese-Chinese Dictionary Using Kanji/Hanzi Conversion. In: IJCNLP-2005

Kang,S.-S. & C.-W. Woo(2001), Automatic Segmentation of words using syllable
     bigram statistics. In Proceedings of the 6th Natural Language Processing
     Pacific Rim Symposium, p. 729-732.
Lee, Do-Gil, Hae-Chang Rim & Heui-Seok Lim(2003), A Syllable Based Word
     Recognition Model for Korean Noun Extraction. In: Proceedings of the 41st
     Annual Meeting of the Association for Computational Linguistics. P03-1060.




                                           1

				
DOCUMENT INFO
Shared By:
Categories:
Tags:
Stats:
views:18
posted:11/27/2011
language:Esperanto
pages:1