Navigation
  • Home
  • Recent
  • Most Active
  • Popular
  • Blog
  • Credits
  • RSS
  •   Interaction
  • Register
  • Statistics
  •   Help
  • Suggestions
  • Contact Us
  • How to Edit
  • Help



  • [Edit]


    In statistics, regression analysis is used to model relationships between variables, determine the magnitude of the relationships between variables, and can be used to make predictions based on the models.

        Regression analysis
            Introduction
                Simple and multiple linear regression
                Nonlinear regression models
                Other models
            Linear models
            Regression and Bayesian statistics
            Examples
                Prediction of future observations
            See also
            Software

    top

    Introduction
    Regression analysis models the relationship between one or more response variables (also called dependent variables, explained variables, predicted variables, or regressands) (usually named Y), and the predictors (also called independent variables, explanatory variables, control variables, or regressors,) usually named X_1, ..., X_p).
    Multivariate regression describes models that have more than one response variable.

    top

    Simple and multiple linear regression
    Simple linear regression and multiple linear regression are related statistical methods for modeling the relationship between two or more random variables using a linear equation. Simple linear regression refers to a regression on two variables while multiple regression refers to a regression on more than two variables. Linear regression assumes the best estimate of the response is a linear function of some parameters (though not necessarily linear on the predictors).

    top

    Nonlinear regression models
    If the relationship between the variables being analyzed is not linear in parameters, a number of nonlinear regression techniques may be used to obtain a more accurate regression.

    top

    Other models
    Although these three types are the most common, there also exist Poisson regression, supervised learning, and unit-weighted regression.

    top

    Linear models

    Predictor variables may be defined quantitatively (i.e., continuous) or qualitatively (i.e., categorical). Categorical predictors are sometimes called factors. Although the method of estimating the model is the same for each case, different situations are sometimes known by different names for historical reasons:


    The linear model usually assumes that the dependent variable is continuous. If least squares estimation is used, then if it is assumed that the error component is normally distributed, the model is fully parametric. If it is not assumed that the data are normally distributed, the model is semi-parametric. If the data are not normally distributed, there are often better approaches to fitting than least squares. In particular, if the data contain outliers, robust regression might be preferred.

    If two or more independent variables are correlated, we say that the variables are multicollinear. Multicollinearity results in parameter estimates that are unbiased and consistent, but which may have relatively large variances.

    If the regression error is not normally distributed but is assumed to come from an exponential family, generalized linear models should be used. For example, if the response variable can take only binary values (for example, a Boolean or Yes/No variable), logistic regression is preferred. The outcome of this type of regression is a function which describes how the probability of a given event (e.g. probability of getting "yes") varies with the predictors.

    top

    Regression and Bayesian statistics
    Maximum likelihood is one method of estimating the parameters of a regression model, which behaves well for large samples. However, for small amounts of data, the estimates can have high variance or bias. Bayesian methods can also be used to estimate regression models. A prior is placed over the parameters, which incorporates everything known about the parameters. (For example, if one parameter is known to be non-negative, a non-negative distribution can be assigned to it.) A posterior distribution is then obtained for the parameter vector. Bayesian methods have the advantages that they use all the information that is available. They are exact, not asymptotic, and thus work well for small data sets if some contextual information is available to be used in the prior. Some practitioners use maximum a posteriori (MAP) methods, a simpler method than full Bayesian analysis, in which the parameters are chosen that maximize the posterior mode. MAP methods are related to Occam's Razor: there is a preference for simplicity among a family of regression models (curves) just as there is a preference for simplicity among competing theories.

    top

    Examples
    To illustrate the various goals of regression, we will give three examples.

    top

    Prediction of future observations
    The following data set gives the average heights and weights for American women aged 30-39 (source: The World Almanac and Book of Facts, 1975).



    We would like to see how the weight of these women depends on their height. We are therefore looking for a function eta such that Y=eta(X)+varepsilon, where Y is the weight of the women and X their height. Intuitively, we can guess that if the women's proportions are constant and their density too, then the weight of the women must depend on the cube of their height. A plot of the data set confirms this supposition:



    vec will denote the vector containing all the measured heights (vec=(58,59,60,cdots)) and vec=(115,117,120,cdots) is the vector containing all measured weights. We can suppose the heights of the women are independent from each other and have constant variance, which means the Gauss-Markov assumptions hold. We can therefore use the least-squares estimator, i.e. we are looking for coefficients heta^0, heta^1 and heta^2 satisfying as well as possible (in the sense of the least-squares estimator) the equation:

    vec= heta^0 + heta^1 vec + heta^2 vec^3+vec


    Geometrically, what we will be doing is an orthogonal projection of Y on the subspace generated by the variables 1, X and X^3. The matrix X is constructed simply by putting a first column of 1's (the constant term in the model) a column with the original values (the X in the model) and a third column with these values cubed (X^3). The realization of this matrix (i.e. for the data at hand) can be written:




    The matrix (mathbf^t mathbf)^ (sometimes called "information matrix" or "dispersion matrix") is:


    leftegin{matrix} 1.9cdot10^3&-45&3.5cdot 10^{-3}\ -45&1.0&-8.1cdot 10^{-5}\ 3.5cdot 10^{-3}&-8.1cdot 10^{-5}&6.4cdot 10^{-9} end{matrix} ight

    Vector widehat_ is therefore:

    widehat_=(X^tX)^X^y= (147, -2.0, 4.3cdot 10^)

    hence eta(X) = 147 - 2.0 X + 4.3cdot 10^ X^3

    A plot of this function shows that it lies quite closely to the data set:


    The confidence intervals are computed using:

    widehat{ heta_j}-widehat{sigma}sqrt{s_j}t_{n-p;1- rac{alpha}{2}};widehat{ heta_j}+widehat{sigma}sqrt{s_j}t_{n-p;1- rac{alpha}{2}}


    with:

    widehat=0.52

    s_1=1.cdot 10^3, s_2=1.0, s_3=6.4cdot 10^;

    alpha=5%

    t_=2.2


    Therefore, we can say that the 95% confidence intervals are:

    heta^0in112 , 181


    heta^1in-2.8 , -1.2


    heta^2in3.6cdot 10^{-4} , 4.9cdot 10^{-4}


    top

    See also

    top

    Software
      There are many minor softwares specialized in a niche form of regression
     
    Search more:
     

       
    Source Privacy License Download Contact Us Atlas
    Scientus.org Dictionary (Yet Another Wiki) RC : 1.39
    This article is licensed under the GNU Free Documentation License [copyleft]. It uses material from the Wikipedia article "Regression analysis". link