Statistics

115 readers
1 users here now

See also [email protected]

founded 4 years ago
MODERATORS
1
 
 

One of the most important decisions you face as a forecaster is simply when to publish a statistical model for public consumption. If you’re just running a model for your personal edification — or to make bets with — the threshold may actually be lower. If you’re evaluating the impact of a player injury on an NFL or NBA game that you’re considering betting on, for instance, then you might only get a couple of minutes before some reasonably rational assessment of the impact has already been priced into prevailing betting lines. Under these circumstances, a good first-pass estimate can go a long way. By the time you dot all the ‘i’s and cross all the ‘t’s to incorporate the impact of the injury into a formal model, it may be too late.

When you issue a statistical forecast publicly, though, I think the responsibility is slightly greater. In some cases, probabilistic forecasts can be confusing to people. And in other circumstances, people can take statistical models too seriously and treat them as oracular when in fact all models rely on the researcher’s assumptions. Let’s not get too carried away with this — some assumptions are better than others, which is why some models are better than others. (And putting a model behind a paywall is a pretty useful trick for self-selecting a more knowledgeable reader base.) But there are times when a subjective estimate may be better, especially in unforeseen circumstances that your model wasn’t really designed to handle.

For instance, when Joe Biden dropped out of the presidential race last Sunday, I suppose we could have just done a hot swap and immediately replaced him with Kamala Harris — pollsters have periodically tested the Harris vs. Trump matchup, especially since Biden’s disastrous debate on June 27. But I think this would have misinformed even our smart, self-selected group of Silver Bulletin readers more than it informed them. The polls were already in flux, given Biden’s mounting crisis on top of the assassination attempt against Trump on top of the Republican convention, which is typically a period when polls can produce short-lived bounces. And Harris’s candidacy was still hypothetical, although she was clearly prepared, working behind the scenes to become the Democrats’ presumptive nominee within 24-48 hours.

2
3
 
 

Hey, new to the Fediverse and thought I would put this out there.

If you teach stats or use stats in your job, you might be interested in using my GUI for R. I hope this doesn't count as an ad, since it is free.

I like how versatile R is, but I hate remembering how to type in what I already know it can do.

So I wrote an app.

You can tell I am not a programmer or a computer scientist (I am an engineer, industrial researcher and occasional teacher) but it works to do the basic to the moderate stats you probably use all the time.

It is free. I just put some of my blog posts and books on the splash page. If you register I only send you info about the app.

Let me know if you would like any enhancements. I continue to add stuff to it until I can do most of what the practical researcher would need.

4
 
 
5
 
 

cross-posted from: https://lemmy.ml/post/561839

On the X axis is the relative size of the public sector in each country, as a percentage. On the Y axis, PPP-adjusted GDP per capita, also adjusted by a function I made that converts mean into median income given the Gini index. This function can be approximated as Value*(1 - Gini/0.9).

6
7
 
 

Video in which Matt Parker explains Benford's Law and why it cannot be applied to elections