weka控制台指令

java weka.classifiers.trees.J48 -t data/weather.arff

java 类的完整名称 -t表示下一个参数是训练数据集的名称

java weka.classifiers.trees.J48 -h

查看java命令行中各个参数的具体含义

-h or -help
    Output help information.
-synopsis or -info
    Output synopsis for classifier (use in conjunction  with -h)
-t <name of training file>
    Sets training file.
-T <name of test file>
    Sets test file. If missing, a cross-validation will be performed
    on the training data.
-c <class index>
    Sets index of class attribute (default: last).
-x <number of folds>
    Sets number of folds for cross-validation (default: 10).
-no-cv
    Do not perform any cross validation.
-force-batch-training
    Always train classifier in batch mode, never incrementally.
-split-percentage <percentage>
    Sets the percentage for the train/test set split, e.g., 66.
-preserve-order
    Preserves the order in the percentage split.
-s <random number seed>
    Sets random number seed for cross-validation or percentage split
    (default: 1).
-m <name of file with cost matrix>
    Sets file with cost matrix.
-disable <comma-separated list of evaluation metric names>
    Comma separated list of metric names not to print to the output.
    Available metrics:
    Correct,Incorrect,Kappa,Total cost,Average cost,KB relative,KB information,
    Correlation,Complexity 0,Complexity scheme,Complexity improvement,
    MAE,RMSE,RAE,RRSE,Coverage,Region size,TP rate,FP rate,Precision,Recall,
    F-measure,MCC,ROC area,PRC area
-l <name of input file>
    Sets model input file. In case the filename ends with ‘.xml‘,
    a PMML file is loaded or, if that fails, options are loaded
    from the XML file.
-d <name of output file>
    Sets model output file. In case the filename ends with ‘.xml‘,
    only the options are saved to the XML file, not the model.
-v
    Outputs no statistics for training data.
-o
    Outputs statistics only, not the classifier.
-i
    Outputs detailed information-retrieval statistics for each class.
-k
    Outputs information-theoretic statistics.
-classifications "weka.classifiers.evaluation.output.prediction.AbstractOutput + options"
    Uses the specified class for generating the classification output.
    E.g.: weka.classifiers.evaluation.output.prediction.PlainText
-p range
    Outputs predictions for test instances (or the train instances if
    no test instances provided and -no-cv is used), along with the
    attributes in the specified range (and nothing else).
    Use ‘-p 0‘ if no attributes are desired.
    Deprecated: use "-classifications ..." instead.
-distribution
    Outputs the distribution instead of only the prediction
    in conjunction with the ‘-p‘ option (only nominal classes).
    Deprecated: use "-classifications ..." instead.
-r
    Only outputs cumulative margin distribution.
-z <class name>
    Only outputs the source representation of the classifier,
    giving it the supplied name.
-g
    Only outputs the graph representation of the classifier.
-xml filename | xml-string
    Retrieves the options from the XML-data instead of the command line.
-threshold-file <file>
    The file to save the threshold data to.
    The format is determined by the extensions, e.g., ‘.arff‘ for ARFF
    format or ‘.csv‘ for CSV.
-threshold-label <label>
    The class label to determine the threshold data for
    (default is the first label)

Options specific to weka.classifiers.trees.J48:

-U
    Use unpruned tree.
-O
    Do not collapse tree.
-C <pruning confidence>
    Set confidence threshold for pruning.
    (default 0.25)
-M <minimum number of instances>
    Set minimum number of instances per leaf.
    (default 2)
-R
    Use reduced error pruning.
-N <number of folds>
    Set number of folds for reduced error
    pruning. One fold is used as pruning set.
    (default 3)
-B
    Use binary splits only.
-S
    Don‘t perform subtree raising.
-L
    Do not clean up after the tree has been built.
-A
    Laplace smoothing for predicted probabilities.
-J
    Do not use MDL correction for info gain on numeric attributes.
-Q <seed>
    Seed for random data shuffling (default 1).

weka.core

weka核心包，基本所有类都与他有联系

核心包中的关键类：Attribute：包含attribute’s name, its type, and, in the case of a nominal or string attribute, its possible values

Instance：contains the attribute values of a particular instance

Instances：holds an ordered set of instances—in other words, a dataset

weka.classifiers

内容：contains implementations of most of the algorithms for clas-sification and numeric prediction

关键抽象类：Classifier---->>defines the general structure of any scheme for classification or numeric prediction

包含三个核心方法：buildClassifier(), classifyInstance(),distributionForInstance()

继承这个抽象类的例子：

weka.classifiers.trees.DecisionStump
覆写了distributionForInstance()
包含getRevision()，simply returns the revision number of the classifier，used by Weka maintainers when diagnosing and debugging problems reported by users.
包含globalInfo()，returns a string describing the classifier, which, along with the scheme’s options
包含toString()， returns a textual representation of the classifier
包含toSource()，s used to obtain a source code repre-sentation of the learned classifier
包含main()，called when you ask for a decision stump from the command line，相当于执行这个类的入口
包含getCapabilities() ，called by the generic object editor to provide information about the capabilities of a learning scheme

其他的一些比较重要的包

weka.associations

：contains association-rule learners

weka.clusterers

：contains methods for unsupervised learning.包含非监督学习方法

weka.datagenerators

：产生人工数据

weka.estimators package

：computes different types of probability distribution

weka.filters

：提供数据清理的相关方法

时间： 2024-10-07 20:32:57

weka控制台指令的相关文章

梁梦瑶 20190905-3 命令行和控制台编程

此作业的要求参见https://edu.cnblogs.com/campus/nenu/2019fall/homework/5523 1.熟悉命令行和控制台假设在当前目录下存在应用程序 a.exe 和文件文件 b., 请以数据流图并辅助以文字说明下述控制台命令的作用是什么.(5分) a.exe < b.txt > c.txt 答:文档b作为程序a的输入,结果输出到c文档里. 请用C言开发应用程序d.exe,从控制台指令读入命令行参数,并在控制台分别打印出a.b.c的值.运行效果形如下面

消息化服务自实现设计

背景一直想自己试试设计一个只属于自己的小框架,于是就有了这个项目. 代码地址 https://coding.net/u/mich/p/MiniServer/git 项目说明该项目主要将每个服务都配置一个消息队列,服务间通过消息进行传递,同时提供了控制台指令,进行服务的管理,实现一些后台的工作(主要web太麻烦了,所以控制台输入最省力),该项目只处理过一些小型后台任务,未在正式大型项目中实际使用过! 未来展望 1. 以后估计会将以往做过的微博用户数据抓取,tower对接,股票数据抓取及分析等已

WSS存储服务器（Windows Storage Server） 2012新功能解析

虽然最近一段时间有关微软的新闻大多数集中在Windows 8以及Surface平板设备身上,但数周之前Windows Server 2012新版本中所包含的Windows Storage Server 2012同样值得我们关注.这款产品分类两个版本,工作组版与标准版.工作组版本主要针对小型用户群组,授权许可中将其业务连接数量上限设置为250个.它同时也对底层硬件做出限制,要求用户使用单CPU插槽.最高32GB内存及最多六块磁盘驱动器. 其标准版则在功能与特性方面与Windows Server 2

Android08_广播接受者_服务

1 广播接受者广播的发送是通过intent开启的需要被接收的广播:电量不足,sd卡被移除,电话移除,短信到来等等 1.1 监视sd卡的状态需求:监视sd卡的状态,并提示用户受到影响的功能. 步骤:①买个收音机创建一个类,继承BroadcastReceiver类(可以看做当一个类继承了该类,如果sd卡发生状态改变会接收到一条信息?) ②装上电池在清单文件中配置 <receiver>标签(接受者) 配置属性name=”全包名”一定要配置到对应的类,不然会报错 ③调到对应频道在rece

第一次作业_文档

计算器——工程文档一．输入与格式 1.数据规模本文档的输入基于如下的要求: (1)既然是小学生,我们假设他们不会计算超过10亿的数字. (2)既然是出考试题,那么也不会出超过10亿道题目. 也就是说 n , r ∈ int; 2.格式要求: 本文档的输出格式为: (1) 当制造试题时,输出为n行,每行是算式编号.一个句号和一个空格,后面是一个算式.算式包含等号.四则运算符号和等号前后各有一个空格.其余符号则没有.答案输出为n行,每行是算式编号.一个句号和一个空格,后面是一个答案.

【C#】Winform工具-闪讯下快速分享Wifi（源码）

A.工具简介最近忙着改论文,但迫于手机没有流量,反复共享电脑wifi的操作已经让我忍无可忍,而“猎豹wifi”“360wifi分享”等工具曾导致电脑多次睡死(可能是台式机的USB网卡兼容性问题).因此抽空写了个便利工具,完全利用windows自带功能实现,主要合并了“命令提示符”的若干指令: 设置windows自带wifi热点: 开启.关闭wifi: 自动关机与取消:(主要是本人喜欢睡前趴床上再玩会手机,又懒得起身关电脑) 以及部分.NET对系统进行管理: 关闭闪讯进程(避免闪讯锁定共享功能)

Microsoft Visual C++ 6.0预处理器参考手册

返回总目录 Microsoft Visual C++ 6.0 预处理器参考手册目录引言.......................................................................................................................3 特殊术语.................................................................................

Unity与Android的对调

本文结构使用注意新工程 Unity传Android参数 Android传Unity参数 SDK嵌入 SDK封装入类 ? 使用注意安装APK,通常旧的不需要删除,安装后会被覆盖: 任何代码文件都可以使用UnityPlayer.UnitySendMessage() 向Unity发消息.只要import com.unity3d.player.UnityPlayer;然后写UnityPlayer.UnitySendMessage()即可: Android调用Android的相关方法,参数Unity

虚幻4独立服务器设置

1.安装VS2015update2任意版本(VS Community可免费官网获取),准备较大空间的磁盘位置作为工作空间(很重要,至少60G!) 2.从github获取虚幻引擎存储库链接,克隆到工作空间.需要你的Github账号与虚幻账号绑定,具体操作参考虚幻官网相关内容. 3.先后执行Setup.bat和GenerateProjectFiles.bat两个批处理文件,Setup.bat会下载很多依赖项,根据网速不同执行时间也不同,需等待执行完后窗口自动退出.完毕后得到UE4.sln,再用vs打