7.1 Introduction

Generalized linear models (GLM) extend the concept of the well understood linear regression model. The linear model assumes that the conditional expectation of

(the dependent or response variable) is equal to a linear combination $\boldsymbol{X}^\top\boldsymbol{\beta}$ , i.e.

Example 1 (Bernoulli responses)
Let us illustrate a binary response model (Bernoulli ) using a sample on credit worthiness. For each individual in the sample we know if the granted loan has defaulted or not. The responses are coded as

$\displaystyle Y=\left\{\begin{array}{ll} 1 & \quad \textrm{loan defaults},\\ [-1mm] 0 & \quad \textrm{otherwise}. \end{array}\right.$

The term of interest is how credit worthiness depends on observable individual characteristics $\boldsymbol {X}$ (age, amount and duration of loan, employment, purpose of loan, etc.). Recall that for a Bernoulli variable $P(Y=1\vert\boldsymbol{X})=E(Y\vert\boldsymbol{X})$ holds. Hence, the default probability $P(Y=1\vert\boldsymbol{X})$ equals a regression of on $\boldsymbol {X}$ . A useful approach is the following logit model:

$\displaystyle P(Y=1\vert\boldsymbol{X}=\boldsymbol{x})= \frac{1}% {1+\exp(-\boldsymbol{x}^\top \boldsymbol{\beta})}.$

Here the function of interest $E(Y\vert\boldsymbol{X})$ is linked to a linear function of the explanatory variables by the logistic cumulative distribution function (cdf) $F(u)=1/(1+e^{-u})=e^u/(1+e^u)$ .

The term generalized linear models (GLM) goes back to [29] and [27] who show that if the distribution of the dependent variable

is a member of the exponential family, then the class of models which connects the expectation of

to a linear combination of the variables $\boldsymbol{X}^\top\boldsymbol{\beta}$ can be treated in a unified way. In the following sections we denote the function which relates $\mu=E(Y\vert\boldsymbol{X})$ and $\eta=\boldsymbol{X}^\top \boldsymbol{\beta}$ by $\eta=G(\mu)$ or