英语翻译-
来源:学生作业帮助网 编辑:六六作业网 时间:2024/12/26 18:01:57
英语翻译-
英语翻译
-
英语翻译-
project name: A text classifier based on Naive Bayes Algorithm
Time: 2010/11 - 2010/12
Tools: MyEclipse8.5 + Python 2.5 + Weka3.6 + Lucene + IKAnalyzer
Project Description:
This is a Chinese text classification system that I completed when I was learning machine learning and data mining. The System is divided into two parts, the first part is the emails classification, the second part is articles classification.In the part of emails classification, the system can filter out spam after training. In the part of articles classification, the system is able to post each article in the collection to the relevant category after training.
Duty Description:
Using python to implement a simple crawler to crawl on the articles on from the Internet. Realizing the Naive Bayes Algorithm of the part of emails classification. Designing the framework of the system and realizing the relevant source code for the parts of Chinese Word Segmentation, building the Bayesian model and text categorization.
Project name: based on bayesian algorithm text classifier
Time: 2010 November - December 2010
Development tools: MyEclipse8.5 + Python 2.5 + Weka3.6 + Lucene + IKAnalyzer
Project descri...
全部展开
Project name: based on bayesian algorithm text classifier
Time: 2010 November - December 2010
Development tools: MyEclipse8.5 + Python 2.5 + Weka3.6 + Lucene + IKAnalyzer
Project description:
This system is I'm learning machine learning and data mining, and implements a when Chinese text classification procedures. System is divided into two parts, the first part is the mail classification, the 2nd part is article categories. In part, the system through email classification training, can from mail collection filter out spam. In part, the system through the classification, to the training of the collection of lu allocated to the relevant categories.
Responsibility description:
Implements a simple Python crawler used to grab Web articles, realized the mail plain bayes classification part classification algorithm, the design framework of the classifier, realized the Chinese word segmentation part, establish bayesian model part, text classification part of the relevant source.
正确の!
收起
Project name: based on bayesian algorithm text classifier
Time: 2010 November - December 2010
Development tools: MyEclipse8.5 + Python 2.5 + Weka3.6 + Lucene + IKAnalyzer
Project descri...
全部展开
Project name: based on bayesian algorithm text classifier
Time: 2010 November - December 2010
Development tools: MyEclipse8.5 + Python 2.5 + Weka3.6 + Lucene + IKAnalyzer
Project description:
This system is I'm learning machine learning and data mining, and implements a when Chinese text classification procedures. System is divided into two parts, the first part is the mail classification, the 2nd part is article categories. In part, the system through email classification training, can from mail collection filter out spam. In part, the system through the classification, to the training of the collection of lu allocated to the relevant categories.
Responsibility description:
Implements a simple Python crawler used to grab Web articles, realized the mail plain bayes classification part classification algorithm, the design framework of the classifier, realized the Chinese word segmentation part, establish bayesian model part, text classification part of the relevant source.
收起