Journal of Jianghan University(Natural Science Edition) ›› 2013, Vol. 41 ›› Issue (2): 47-52.
Previous Articles Next Articles
CHENG Yuan-bin
Online:
Published:
Abstract: Language is the main tool of thinking. Words are the basic unit of language. However,character encoding is the present encoding method in computer information processing. With in-depth development of computer information processing,the disadvantages of character encoding increasingly appear. From the basic needs of information processing and the basic characteristics of the words,an unified encoding scheme on comprehensive consideration of word-character,and word-oriented is proposed. The scheme based on the existing coding standard UTF-16,maintains the existing character encoding,adds words coding;words encoding are logical organized with the concept space tree including some semantic information and semantic relationship,adapting to cluster retrieval and language code convert between two languages are the principles of spatial organization. At last,points out several problems which need further study.
Key words: words encoding, UTF-16, cluster retrieval, concept space tree, natural language processing
CLC Number:
TP391.11
CHENG Yuan-bin. Encoding Scheme Based on Words[J]. Journal of Jianghan University(Natural Science Edition), 2013, 41(2): 47-52.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: https://qks.jhun.edu.cn/jhdx_zk/EN/
https://qks.jhun.edu.cn/jhdx_zk/EN/Y2013/V41/I2/47