SGI MLC++ Utilities README file. November 1997 Machine Learning Library in C++. http://www.sgi.com/Technology/mlc Keywords: machine learning, accuracy estimation, cross-validation, bootstrap, ID3, decision trees, decision graphs, naive-bayes, decision tables, majority, induction algorithms, classifiers, categorizers, general logic diagrams, instance-based algorithms, discretization, lazy learning, bagging, MineSet. MLC++ Team: The MLC++ team is continuing to work on research and development of machine learning techniques for data mining. The team is now working at Silicon Graphics, as part of the MineSet project for data mining and visualization: http://mineset.sgi.com. Core team members include: Ronny Kohavi (manager) Cliff Brunk Alex Kozlov Clay Kunz Dan Sommerfield Eric Eros (contractor). Questions or help requests related to the utilities should be addressed to mlc@postofc.corp.sgi.com Please see the MLC++ home page first: http://www.sgi.com/Technology/mlc ______________________________________________________________________ Quick starter: The MLC++ utilities are accessible through our web page http://www.sgi.com/Technology/mlc cd gzcat kit-name | tar xvf - (or "tar xvf -" if you got have the .tar file) where is the directory under which the mlc directory will be built (e.g., /usr/local or your own subdir). The documentation is in utils.ps and it is currently OUT OF DATE. Expect an update in the next month. Environment varibles: MLCDIR must be set to the directory where the utilities are installed. MLCPATH to the directory where the databases are stored (can be colon-separated paths). On SGI, the LD_LIBRARY_PATH must include the directory where the .so files reside (dynamically shared) unless they are in /usr/lib or /lib. Example setup file (can be sourced in .login). setenv MLCDIR "/usr/local/mlc" setenv LD_LIBRARY_PATH "/usr/lib:/lib:$MLCDIR" setenv MLCPATH ".:$MLCDIR/db" If you have not "registered" through the web page, please do so at http://www.sgi.com/Technology/mlc/mail.html Databases in the MLC++ format, which is very similar to C4.5 format can be found in http://www.sgi.com/Technology/mlc/db/ Most datafiles are converted from the repository at UC Irvine. ______________________________________________________________________