Computing the standard normal cumulative distribution with R

I recently had to compute the Bayes error for a Gaussian model.
Let x_i be a predictive variable, following a Gaussian distribution of variance 1 and mean T_i in a class, and -T_i in the other class.
The Bayes error of the model is e=1-Φ(sqrt(sum(T_i^2)).
In R, the standard normal cumulative distribution function is computed using pnorm().

So, here’s the Bayes error if for instance we have 10 variables with mean 0.5 in a class and -0.5 in the other class:
Bayes_error=1-pnorm(1*sqrt(10*0.5^2));

If we had more complicated things, like 5 variables with mean 0.5,0.6,0.7,0.8,0.9 in a class and -0.5,-0.6,-0.7,-0.8,-0.9 in the other, we could do something like:
mus=c(0.5,0.6,0.7,0.8,0.9); Bayes_error=1-pnorm(1*sqrt(t(mus)%*%mus));
or not vectorized: Bayes_error=1-pnorm(1*sqrt(sum(mus*mus)));

Edit: I just found again about this post through my statistics panel, and I really have no idea why I titled it “computing the std normal cumulative distrib”. It should rather be “computing the Bayes error”… Leaving it this way in order not to mess with the established pretty URL, though.

Posted in R (R-project), statistics.

rev="post-2414" No comments

By patheticcockroach – 2011-11-21

0 Responses

Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.

« VisualGPG 0.1.2 R: removing the last elements of a vector the easy way »

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30

Computing the standard normal cumulative distribution with R

0 Responses

See also…

Recent Comments

Meta

Calendar

Archives

Computing the standard normal cumulative distribution with R

0 Responses

Subscribe

See also…

Recent Comments

Meta

Calendar

Archives