Navigation
  • Home
  • Recent
  • Most Active
  • Popular
  • Blog
  • Credits
  • RSS
  •   Interaction
  • Register
  • Statistics
  •   Help
  • Suggestions
  • Contact Us
  • How to Edit
  • Help



  • [Edit]



    As a broad subfield of artificial intelligence, Machine learning is concerned with the development of algorithms and techniques that allow computers to "learn". At a general level, there are two types of learning: inductive, and deductive. Inductive machine learning methods create computer programs by extracting rules and patterns out of massive data sets. It should be noted that although pattern identification is important to Machine Learning, without rule extraction a process falls more accurately in the field of data mining.

    Machine learning overlaps heavily with statistics. In fact, many machine learning algorithms
    have been found to have direct counterparts with statistics. For example, boosting is
    now widely thought to be a form of stagewise regression using a specific type of loss function.

    Machine learning has a wide spectrum of applications including natural language processing, search engines, medical diagnosis, bioinformatics and cheminformatics, detecting credit card fraud, stock market analysis, classifying DNA sequences, speech and handwriting recognition, object recognition in computer vision, game playing and robot locomotion.


        Machine learning
            Human interaction
            Algorithm types
            Machine learning topics
            See also
            Bibliography
                General resources
                Journals and Conferences
                Research groups
                Software

    top

    Human interaction

    Some machine learning systems attempt to eliminate the need for human intuition in the analysis of the data, while others adopt a collaborative approach between human and machine. Human intuition cannot be entirely eliminated since the designer of the system must specify how the data are to be represented and what mechanisms will be used to search for a characterization of the data. Machine learning can be viewed as an attempt to automate parts of the scientific method. Some machine learning researchers create methods within the framework of Bayesian statistics.

    top

    Algorithm types

    Machine learning algorithms are organized into a taxonomy, based on the desired outcome of the algorithm. Common algorithm types include:

      supervised learning --- where the algorithm generates a function that maps inputs to desired outputs. One standard formulation of the supervised learning task is the classification problem: the learner is required to learn (to approximate the behavior of) a function which maps a vector X_1, X_2, ldots X_N, into one of several classes by looking at several input-output examples of the function.
      semi-supervised learning --- which combines both labeled and unlabeled examples to generate an appropriate function or classifier.
      reinforcement learning --- where the algorithm learns a policy of how to act given an observation of the world. Every action has some impact in the environment, and the environment provides feedback that guides the learning algorithm.
      transduction --- similar to supervised learning, but does not explicitly construct a function: instead, tries to predict new outputs based on training inputs, training outputs, and new inputs.

    The performance and computational analysis of machine learning algorithms is a branch of theoretical computer science known as computational learning theory.

    top

    Machine learning topics

    This list represents the topics covered on a typical machine learning course.

      Inductive Transfer and Learning to Learn
      Approximate inference techniques:
      Meta-Learning (Ensemble methods):
      Optimization: most of methods listed above either use optimization or are instances of optimization algorithms.
      Multi-objective Machine Learning: An approach that addresses multiple, and often confliciting learning objectives explicitly using Pareto-based multi-objective optimization techniques.

    top

    See also

    top

    Bibliography
      Ryszard S. Michalski, Jaime G. Carbonell, Tom M. Mitchell (1983), Machine Learning: An Artificial Intelligence Approach, Tioga Publishing Company, ISBN 0-935382-05-4
      Ryszard S. Michalski, Jaime G. Carbonell, Tom M. Mitchell (1986), Machine Learning: An Artificial Intelligence Approach, Volume II, Morgan Kaufmann, ISBN 0-934613-00-1
      Yves Kodratoff, Ryszard S. Michalski (1990), Machine Learning: An Artificial Intelligence Approach, Volume III, Morgan Kaufmann, ISBN 1-55860-119-8
      Ryszard S. Michalski, George Tecuci (1994), Machine Learning: A Multistrategy Approach, Volume IV, Morgan Kaufmann, ISBN 1-55860-251-8
      Bhagat, P. M. (2005). Pattern Recognition in Industry, Elsevier. ISBN 0-08-044538-1
      Bishop, C. M. (1995). Neural Networks for Pattern Recognition, Oxford University Press. ISBN 0-19-853864-2
      Richard O. Duda, Peter E. Hart, David G. Stork (2001) Pattern classification (2nd edition), Wiley, New York, ISBN 0-471-05669-3
      Huang T.-M., Kecman V., Kopriva I. (2006), Kernel Based Algorithms for Mining Huge Data Sets, Supervised, Semi-supervised, and Unsupervised Learning, Springer-Verlag, Berlin, Heidelberg, 260 pp. 96 illus., Hardcover, ISBN 3-540-31681-7*
      KECMAN Vojislav (2001), LEARNING AND SOFT COMPUTING, Support Vector Machines, Neural Networks and Fuzzy Logic Models, The MIT Press, Cambridge, MA, 608 pp., 268 illus., ISBN 0-262-11255-8*
      Mitchell, T. (1997). Machine Learning, McGraw Hill. ISBN 0-07-042807-7
      Sholom Weiss and Casimir Kulikowski (1991). Computer Systems That Learn, Morgan Kaufmann. ISBN 1-55860-065-5

    top

    General resources
      MLpedia – wiki dedicated to machine learning.

    top

    Journals and Conferences

    top

    Research groups

    top

    Software
      PRTools is another complete package similar to SPIDER and implemented in MATLAB. SPIDER seems to have more native support and functions for kernel methods, but PRTools has a slightly larger variety of other machine learning tools. PRTools has an accompanying textbook and much better documentation. Both SPIDER and PRTools are available freely for non-commercial applications.
      Orange is a machine learning suite with Python scripting and a visual programming interface.
      Weka Machine Learning Software providing machine learning operators for pattern classification, regression, clustering, association rule learning, and meta-operators like e.g. ensemble learners.
      MATLAB, by The MathWorks, has toolbox support for many machine learning tools. The Bioinformatics toolbox includes Support Vector Machines and KNN classifiers. The Statistics toolbox includes linear discriminant and decision tree classification. The Neural Network toolbox is a complete set of tools for implementing Neural Networks (PRTools relies on it for its neural network classifiers). New methods for classifier performance evaluation and cross validation make MATLAB more attractive for machine learning.
      Synapse by Peltarion supports the development of a wide range of machine learning systems and the integration of different types of machine learning into hybrid systems.
      MLC++ is a library of C++ classes for supervised machine learning
      questsin an Add-In for Microsoft Excel, that uses machine learning to expand your selection similar to the Popular Fill Data Feature.
      * SemiL is the world first efficient software for solving large scale semi-supervised learning or transductive inference problems using graph based approaches when faced with unlabeled data. It implements various semisupervised learning approaches.
      PCP is a free program for feature selection and supervised pattern classification, written in C. Supports interactive and batch modes.
      AQ21 program seeks different types of patterns in data and represents them in human-oriented forms resembling natural language descriptions. It integrates several novel abilities such as to discover different types of attributional patterns depending on the parameter settings, to optimize patterns according to a large number of different pattern quality criteria, to learn rules with exceptions, to determine optimized sets of alternative hypotheses generalizing the same data, and to handle data with missing, irrelevant and/or not-applicable meta-values.
      iAQ program demonstrates Natural Induction, that is, an ability of a computer program to learn knowledge from data in forms natural to people, and by that easy to understand and interpret. In iAQ, discovered rules are expressed verbally and also as natural language text.
      LEM3 system implements a novel, non-Darwinian methodology for evolutionary computation, called Learnable Evolution Model or LEM. LEM employs a learning program to guide the evolutionary computation. Instead of conventional random mutations and recombinations, LEM employs hypothesis formation and generation operators to create new populations of individuals. LEM3 can handle very complex, non-linear and multi-mode optimization problems with hundreds of controlable multi-type variables, and is particularly advantageous for problems in which the computation of the evaluation function (fitness function) is costly or time-consuming.
      SNoW is a learning architecture that is tailored for learning in the presence of a very large number of features. SNoW learns linear functions via regularized variations of Perceptron and Winnow. The packaage contains a large number of options and also a good sparse implementation of naive Bayes.




     
    Search more:
     

       
    Source Privacy License Download Contact Us Atlas
    Scientus.org Dictionary (Yet Another Wiki) RC : 1.39
    This article is licensed under the GNU Free Documentation License [copyleft]. It uses material from the Wikipedia article "Machine learning". link