-
Notifications
You must be signed in to change notification settings - Fork 4
Mixgene plugins
Preliminary proposal of plugin specifications.
NOTE: legacy implementation: ML_* at http://ida.felk.cvut.cz/svn/research/inspol05-09/pathways/blb/plugins/
The all ML methods are implemented by Weka in the old Xgene app.
Is it possible to define a general way how to pass arguments into the the ML methods? Xgene was based on Weka; therefore, there was a way to pass arguments as a simple text. This feature is meant to allow pass an undefined argument directly into a ML method. Maybe we should limit our ML methods to Orange library only...
Current implementation: R-mixGENE (no kernel choice yet)
options:
- kernel
- polynomial
- degree
- c
- RBF
- polynomial
results:
- model, (acc)
(Alternatively the version designed for gene expression data. There a binary package for obsolete version of R.)
Existing implementation: scikit-learn
options:
- m number of features used for a tree learning
results:
- proximities
- out-of-bag error estimate
- variable importance
- model, (acc)
Existing implementation: http://scikit-learn.org/stable/modules/tree.html or http://scaron.info/pydtl/
options:
results: model, visualized tree, (acc)
Existing implementation: http://scikit-learn.org
options:
- k
- metrics
- distance weight
results:
- model, visualized tree, (acc)
Current mplementation: R-miXGENE
Options:
- statistical correction method
Results: -
R Example: here
Options:
- statistical correction method
Results: -
Ref: A.M.Molinaro,R.Simon,and R.M. Pfeiffer, “Prediction error estimation: a comparison of resampling methods,” Bioin- formatics, vol. 21, no. 15, pp. 3301–3307, 2005.
Implementation:
options:
- s :split percentage
results: EStr, EStt
Implementation:
options:
results:
Implementation: http://scikit-learn.org/
options:
- k :number of folds
FIXME: results:
Implementation: R
options: scale, center
results: —
Implementation:
options:
results:
- boxplot
- statistics
- min, mean, median, max
Ref: Saeys et al., A review of feature selection techniques in bioinformatics. Bioinformatics, 2007
FIXME: How do deal with the zero-variance problem?
legacy code: e134 experiment
Implementation: R
options:
- kernel
- polynomial
- degree
- c
- polynomial
- Reduction of computational cost:
- remove features by batches (more than one feature in one iteration)
- With remembering order of features.
- Returns rank of the all features.
- Without remembering order of features.
- Returns rank of top-n features
- With remembering order of features.
- remove features by batches (more than one feature in one iteration)
results: ES, ranking
Implementation:
Options:
Results: ES
Legacy implementation:
- http://ida.felk.cvut.cz/svn/research/inspol05-09/pathways/blb/plugins/ge_norm/
- http://ida.felk.cvut.cz/svn/xgene/workmess/plugins/Normalization_Quantile/
Implementation: R
options: -
results: ES
Implementation:
options: -
results: ES