-
luyfSearch2.0.tar
一个中文分词开发包,可以用到搜索引擎的开发当中,比较好用。(A Chinese word segmentation development kit, you can use search engine in development and are relatively easy to use.)
- 2009-11-05 10:09:53下载
- 积分:1
-
Chinese-WordCut
这是一个中文分词程序,读入一个Txt文档,可以对里面的段落进行分词(This is a Chinese word segmentation program that reads a Txt document segmentation paragraphs inside)
- 2012-11-18 17:44:16下载
- 积分:1
-
Leza
it s a good code for troias project
- 2009-06-04 06:50:59下载
- 积分:1
-
GB2312ToUnicode
GBK 转 unicode 提供二分法查询(translate the gbk to the unicode,with the bianary search way)
- 2009-12-31 13:17:44下载
- 积分:1
-
HanLP-master
NamedEntityRecognition github
- 2018-01-31 01:47:04下载
- 积分:1
-
wordsegmentation
一种基于自动机的分词方法,可进行中文分词及统计(Based method of automatic machine word)
- 2011-09-21 11:38:57下载
- 积分:1
-
raw
说明: 10个中文分词数据集,用于训练中文分词模型(Ten Chinese Word Segmentation Datasets for Training Chinese Word Segmentation Model)
- 2021-01-06 11:48:53下载
- 积分:1
-
bp
说明: 针对样本决策库数据量大的问题选用基于粗糙集对提取的样本数据进行离散化(Sample library data for decision-making problem of large sample selection based on rough sets were discrete data extraction)
- 2015-07-04 20:49:43下载
- 积分:1
-
201411149222244
随便下载一篇中文的文本文档,通过这个程序可以将文档进行分词处理,还能够统计词语出现的次数(To download a Chinese text documents, through this program can be word processing document, will also be able to statistics the number of occurrences of words and phrases)
- 2015-10-23 10:53:54下载
- 积分:1
-
共现矩阵
说明: 将高维数据组转换为二维数据组,方便数据处理工作人员的数据分析,并包含自然语言处理(The multi-dimensional co-occurrence matrix is transformed into two-dimensional array form, and the high-dimensional data group is transformed into two-dimensional data group, which is convenient for data processing staff to analyze data, and includes natural language processing)
- 2020-07-02 16:56:12下载
- 积分:1