Category Sin categoría

Sin categoría, Statistics

The lying cook

The binomial distribution is used when we want to calculate the probability of obtaining a certain number of successes in a series of Bernoulli trials, assuming we already know the probability of success in each trial. In contrast, the beta distribution is used in the opposite situation: we have observed a given number of successes and failures, and we want to estimate how likely each possible value of the success probability is. In other words, it allows us to update our beliefs about that probability based on the data we have collected.

Manuel Molina
07/01/2025

Machine learning, Sin categoría, Statistics

The sympathy of pendulums

The rationale for minimizing the sum of squared errors in linear regression, which is often presented as a simple choice of convenience, is discussed. A probabilistic perspective suggests that the least squares equation arises naturally from assuming that the model's residuals follow a normal distribution.

Manuel Molina
06/10/2025

Sin categoría, Statistics

The tribulations of an astronaut

Binary logistic regression uses the sigmoid function to estimate the probability of the target variable when it is binary. However, this function does not allow direct probability estimates when dealing with nominal variables with more than two categories. In these cases, we will use multinomial logistic regression, which will use the softmax function to estimate the probabilities with respect to a reference category.

Manuel Molina
05/20/2025

Sin categoría, Statistics

The mystery of the imperfect crime

The tau-squared represents the variability of effects between the different populations from which the primary studies of a systematic review are derived, according to the assumption of the random effects model of meta-analysis. Its usefulness for weighting studies and for calculating prediction intervals is described, understanding how its significance goes beyond being a mere indicator of heterogeneity.

Manuel Molina
04/29/2025

Sin categoría, Statistics

Between preferences and coincidences

Cramer's V allows the strength of the association between two categorical (nominal) variables, not ordinal, to be quantified. It is especially useful when the variables have multiple categories, since it allows the strength of the association to be condensed into a single figure. Its values range from 0, no association, to 1, a perfect association.

Manuel Molina
04/08/2025