class
代码说明:
中文文本分类可以对已经分好词的文本进行分类,先自己导入数据,用libsvm中的svm进行分类和预测,特征用tfidf算法,还利用卡方检验进行了特征选择,可自行设定阈值(text mining)
文件列表:
class
.....\ans7.txt,155467,2014-07-11
.....\cachePredictProblem.txt,22457,2014-07-11
.....\cacheProblemFolder.txt,190586,2014-07-11
.....\Classifier.jar,3260397,2011-03-30
.....\configures
.....\..........\classificationLog4j.properties,2394,2011-03-30
.....\..........\features,0,2011-03-30
.....\..........\figureMappingToType,120,2014-07-11
.....\..........\KindMapping,181,2014-07-11
.....\data
.....\....\BigramDict.dct,7544244,2011-03-19
.....\....\coreDict.dct,1565689,2011-03-19
.....\....\lexical.ctx,10412,2011-03-19
.....\....\nr.ctx,1032,2011-03-19
.....\....\nr.dct,113780,2011-03-19
.....\....\ns.ctx,408,2011-03-19
.....\....\ns.dct,54278,2011-03-19
.....\....\tr.ctx,408,2011-03-19
.....\....\tr.dct,64000,2011-03-19
.....\example.log,159618,2014-07-11
.....\ICTCLAS.dll,155648,2011-03-19
.....\map,57849,2014-07-11
.....\model,399883,2014-07-11
.....\ProblemScale.txt,10714,2014-07-11
.....\readme.txt,524,2011-04-02
.....\result
.....\......\Down
.....\......\....\TEST-DOWN1.txt,1259,2014-07-11
.....\......\....\TEST-DOWN3.txt,601,2014-07-11
.....\......\....\TEST-DOWN5.txt,575,2014-07-11
.....\......\....\TEST-UP1.txt,1302,2014-07-11
.....\......\....\TEST-UP3.txt,677,2014-07-11
.....\......\Up
.....\......\..\TEST-DOWN2.txt,636,2014-07-11
.....\......\..\TEST-DOWN4.txt,524,2014-07-11
.....\......\..\TEST-UP2.txt,206,2014-07-11
.....\......\..\TEST-UP4.txt,483,2014-07-11
.....\......\..\TEST-UP5.txt,810,2014-07-11
.....\......\上涨
.....\......\....\11.txt,664,2014-07-09
.....\......\....\13.txt,1127,2014-07-09
.....\......\....\9.txt,582,2014-07-09
.....\......\....\haha.txt,1204,2014-07-09
.....\......\....\TEST-DOWN2.txt,636,2014-07-11
.....\......\....\TEST-DOWN4.txt,524,2014-07-11
.....\......\....\TEST-UP2.txt,206,2014-07-11
.....\......\....\TEST-UP4.txt,483,2014-07-11
.....\......\....\TEST-UP5.txt,810,2014-07-11
.....\......\....\新建文本文档 (2).txt,1086,2014-07-09
.....\......\....\新建文本文档 (3).txt,823,2014-07-09
.....\......\....\新建文本文档.txt,391,2014-07-09
.....\......\....\测试-上涨2.txt,206,2014-07-11
.....\......\....\测试-上涨4.txt,483,2014-07-11
.....\......\....\测试-上涨5.txt,810,2014-07-11
.....\......\....\测试-下跌2.txt,636,2014-07-11
.....\......\....\测试-下跌4.txt,524,2014-07-11
.....\......\....\测试.txt,431,2014-07-09
.....\......\下跌
.....\......\....\10.txt,1163,2014-07-09
.....\......\....\12.txt,852,2014-07-09
.....\......\....\2014年5月.txt,3405,2014-07-10
.....\......\....\2014年六月.txt,3405,2014-07-10
.....\......\....\8.txt,389,2014-07-09
.....\......\....\haha.txt,1204,2014-07-09
.....\......\....\TEST-DOWN1.txt,1259,2014-07-11
.....\......\....\TEST-DOWN3.txt,601,2014-07-11
.....\......\....\TEST-DOWN5.txt,575,2014-07-11
.....\......\....\TEST-UP1.txt,1302,2014-07-11
.....\......\....\TEST-UP3.txt,677,2014-07-11
.....\......\....\新建文本文档 (2).txt,1086,2014-07-09
.....\......\....\新建文本文档 (3).txt,823,2014-07-09
.....\......\....\新建文本文档 (4).txt,743,2014-07-09
.....\......\....\新建文本文档 (5).txt,838,2014-07-09
.....\......\....\新建文本文档.txt,391,2014-07-09
.....\......\....\测试-上涨1.txt,1302,2014-07-11
.....\......\....\测试-上涨3.txt,677,2014-07-11
.....\......\....\测试-下跌1.txt,1259,2014-07-11
.....\......\....\测试-下跌3.txt,601,2014-07-11
.....\......\....\测试-下跌5.txt,575,2014-07-11
.....\......\....\测试.txt,1259,2014-07-11
.....\......\交通
.....\......\....\交 (1).txt,800,2011-04-01
.....\......\....\交 (10).txt,743,2011-04-01
.....\......\....\交 (11).txt,601,2011-04-01
.....\......\....\交 (12).txt,1484,2014-07-09
.....\......\....\交 (13).txt,415,2011-04-01
.....\......\....\交 (15).txt,2847,2011-04-01
.....\......\....\交 (16).txt,831,2011-04-01
.....\......\....\交 (17).txt,1532,2014-07-09
.....\......\....\交 (18).txt,1796,2011-04-01
.....\......\....\交 (19).txt,537,2011-04-01
.....\......\....\交 (2).txt,676,2011-04-01
.....\......\....\交 (22).txt,4758,2011-04-01
.....\......\....\交 (23).txt,5040,2011-04-01
.....\......\....\交 (25).txt,351,2014-07-09
.....\......\....\交 (26).txt,351,2014-07-09
.....\......\....\交 (27).txt,361,2014-07-09
.....\......\....\交 (29).txt,1765,2011-04-01
.....\......\....\交 (3).txt,1411,2011-04-01
下载说明:请别用迅雷下载,失败请重下,重下不扣分!