▍1. CIPP_JSsetup
可以实现自动分词功能,支持自动标引,是处理中文自然语言的良好工具(Can achieve automatic word segmentation function, support for automatic indexing is a good tool to deal with Chinese natural language)
可以实现自动分词功能,支持自动标引,是处理中文自然语言的良好工具(Can achieve automatic word segmentation function, support for automatic indexing is a good tool to deal with Chinese natural language)
this is some code fjhg ldfh ldfgh ldfkjhg dlkjfhg dlkf gdlkf g
这是一个中文分词程序,读入一个Txt文档,可以对里面的段落进行分词(This is a Chinese word segmentation program that reads a Txt document segmentation paragraphs inside)
一种基于自动机的分词方法,可进行中文分词及统计(Based method of automatic machine word)
在Visual C~(++)中使用Unicode编程,世界上有数百种用计算机指定一个数字,来储存字母或其他字符的编码系统。(In Visual C ~(++) use Unicode programming, there are hundreds of the world, with a number assigned to the computer to store letters or other characters in the coding system.)
GBK 转 unicode 提供二分法查询(translate the gbk to the unicode,with the bianary search way)
这个一个基于逆向最大匹配的分词程序,语料规模比较小。(The maximum matching based on the reverse of the sub-term process, relatively small-scale corpus.)
中文信息逆向分词程序 是用api实现的(Chinese Information reverse segmentation process is achieved by api)
程序实现多国语言的动态切换解决方案(procedures for multi-language dynamic switching solutions)