Monthly Archives: May 2016

3 YEARS AGO (MAY 2013): MEMORY LANE

Posted on May 30, 2016 by Mayo

3 years ago…

MONTHLY MEMORY LANE: 3 years ago: May 2013. I mark in red three posts that seem most apt for general background on key issues in this blog [1]. Some of the May 2013 posts blog the conference we held earlier that month: “Ontology and Methodology”. I highlight in burgundy a post on Birnbaum that follows up on my last post in honor of his birthday. New questions or comments can be placed on this post.

May 2013

(5/3) Schedule for Ontology & Methodology, 2013
(5/6) Professorships in Scandal?
(5/9) If it’s called the “The High Quality Research Act,” then ….
(5/13) ‘No-Shame’ Psychics Keep Their Predictions Vague: New Rejected post
(5/14) “A sense of security regarding the future of statistical science…” Anon review of Error and Inference
(5/18) Gandenberger on Ontology and Methodology (May 4) Conference: Virginia Tech
(5/19) Mayo: Meanderings on the Onto-Methodology Conference
(5/22) Mayo’s slides from the Onto-Meth conference
(5/24) Gelman sides w/ Neyman over Fisher in relation to a famous blow-up
(5/26) Schachtman: High, Higher, Highest Quality Research Act
(5/27) A.Birnbaum: Statistical Methods in Scientific Inference
(5/29) K. Staley: review of Error & Inference

[1]Monthly memory lanes began at the blog’s 3-year anniversary in Sept, 2014.

Categories: 3-year memory lane, Statistics | Leave a comment

Allan Birnbaum: Foundations of Probability and Statistics (27 May 1923 – 1 July 1976)

Posted on May 27, 2016 by Mayo

27 May 1923-1 July 1976

Today is Allan Birnbaum’s birthday. In honor of his birthday this year, I’m posting the articles in the Synthese volume that was dedicated to his memory in 1977. The editors describe it as their way of “paying homage to Professor Birnbaum’s penetrating and stimulating work on the foundations of statistics”. I paste a few snippets from the articles by Giere and Birnbaum. If you’re interested in statistical foundations, and are unfamiliar with Birnbaum, here’s a chance to catch up.(Even if you are,you may be unaware of some of these key papers.)

HAPPY BIRTHDAY ALLAN!

Synthese Volume 36, No. 1 Sept 1977: Foundations of Probability and Statistics, Part I

Editorial Introduction:

This special issue of Synthese on the foundations of probability and statistics is dedicated to the memory of Professor Allan Birnbaum. Professor Birnbaum’s essay ‘The Neyman-Pearson Theory as Decision Theory; and as Inference Theory; with a Criticism of the Lindley-Savage Argument for Bayesian Theory’ was received by the editors of Synthese in October, 1975, and a decision was made to publish a special symposium consisting of this paper together with several invited comments and related papers. The sad news about Professor Birnbaum’s death reached us in the summer of 1976, but the editorial project could nevertheless be completed according to the original plan. By publishing this special issue we wish to pay homage to Professor Birnbaum’s penetrating and stimulating work on the foundations of statistics. We are grateful to Professor Ronald Giere who wrote an introductory essay on Professor Birnbaum’s concept of statistical evidence and who compiled a list of Professor Birnbaum’s publications.

THE EDITORS

Continue reading →

Categories: Birnbaum, Error Statistics, Likelihood Principle, Statistics, strong likelihood principle | 7 Comments

Frequentstein: What’s wrong with (1 – β)/α as a measure of evidence against the null? (ii)

Posted on May 22, 2016 by Mayo

In their “Comment: A Simple Alternative to p-values,” (on the ASA P-value document), Benjamin and Berger (2016) recommend researchers report a pre-data Rejection Ratio:

It is the probability of rejection when the alternative hypothesis is true, divided by the probability of rejection when the null hypothesis is true, i.e., the ratio of the power of the experiment to the Type I error of the experiment. The rejection ratio has a straightforward interpretation as quantifying the strength of evidence about the alternative hypothesis relative to the null hypothesis conveyed by the experimental result being statistically significant. (Benjamin and Berger 2016, p. 1)

The recommendation is much more fully fleshed out in a 2016 paper by Bayarri, Benjamin, Berger, and Sellke (BBBS 2016): Rejection Odds and Rejection Ratios: A Proposal for Statistical Practice in Testing Hypotheses. Their recommendation is:

…that researchers should report the ‘pre-experimental rejection ratio’ when presenting their experimental design and researchers should report the ‘post-experimental rejection ratio’ (or Bayes factor) when presenting their experimental results. (BBBS 2016, p. 3)….

The (pre-experimental) ‘rejection ratio’ R_pre , the ratio of statistical power to significance threshold (i.e., the ratio of the probability of rejecting under H₁ and H₀ respectively), is shown to capture the strength of evidence in the experiment for H₁over H₀. (ibid., p. 2)

But in fact it does no such thing! [See my post from the FUSION conference here.] J. Berger, and his co-authors, will tell you the rejection ratio (and a variety of other measures created over the years) are entirely frequentist because they are created out of frequentist error statistical measures. But a creation built on frequentist measures doesn’t mean the resulting animal captures frequentist error statistical reasoning. It might be a kind of Frequentstein monster! [1] Continue reading →

Categories: J. Berger, power, reforming the reformers, S. Senn, Statistical power, Statistics | 36 Comments

Fallacies of Rejection, Nouvelle Cuisine, and assorted New Monsters

Posted on May 14, 2016 by Mayo

Jackie Mason

Whenever I’m in London, my criminologist friend Katrin H. and I go in search of stand-up comedy. Since it’s Saturday night (and I’m in London), we’re setting out in search of a good comedy club (I’ll complete this post upon return). A few years ago we heard Jackie Mason do his shtick, a one-man show billed as his swan song to England. It was like a repertoire of his “Greatest Hits” without a new or updated joke in the mix. Still, hearing his rants for the n^th time was often quite hilarious. It turns out that he has already been back doing another “final shtick tour” in England, but not tonight.

A sample: If you want to eat nothing, eat nouvelle cuisine. Do you know what it means? No food. The smaller the portion the more impressed people are, so long as the food’s got a fancy French name, haute cuisine. An empty plate with sauce!

As one critic wrote, Mason’s jokes “offer a window to a different era,” one whose caricatures and biases one can only hope we’ve moved beyond:

But it’s one thing for Jackie Mason to scowl at a seat in the front row and yell to the shocked audience member in his imagination, “These are jokes! They are just jokes!” and another to reprise statistical howlers, which are not jokes, to me. This blog found its reason for being partly as a place to expose, understand, and avoid them. I had earlier used this Jackie Mason opening to launch into a well-known fallacy of rejection using statistical significance tests. I’m going to go further this time around. I began by needling some leading philosophers of statistics: Continue reading →

Categories: reforming the reformers, science-wise screening, Statistical power, statistical tests, Statistics | Tags: criticism of frequentist methods, fallacy of rejection, Jackie Mason, sample size, significance tests | 5 Comments

Excerpts from S. Senn’s Letter on “Replication, p-values and Evidence”

Posted on May 12, 2016 by Mayo

I first blogged this letter here. Below the references are some more recent blog links of relevance to this issue.

Dear Reader: I am typing in some excerpts from a letter Stephen Senn shared with me in relation to my April 28, 2012 blogpost. It is a letter to the editor of Statistics in Medicine in response to S. Goodman. It contains several important points that get to the issues we’ve been discussing. You can read the full letter here. Sincerely, D. G. Mayo

STATISTICS IN MEDICINE, LETTER TO THE EDITOR

From: Stephen Senn*

Some years ago, in the pages of this journal, Goodman gave an interesting analysis of ‘replication probabilities’ of p-values. Specifically, he considered the possibility that a given experiment had produced a p-value that indicated ‘significance’ or near significance (he considered the range p=0.10 to 0.001) and then calculated the probability that a study with equal power would produce a significant result at the conventional level of significance of 0.05. He showed, for example, that given an uninformative prior, and (subsequently) a resulting p-value that was exactly 0.05 from the first experiment, the probability of significance in the second experiment was 50 per cent. A more general form of this result is as follows. If the first trial yields p=α then the probability that a second trial will be significant at significance level α (and in the same direction as the first trial) is 0.5. Continue reading →

Categories: 4 years ago!, reproducibility, S. Senn, Statistics | Tags: Evidence-based medicine, p-value vs posterior, significance tests, Stephen Senn | 3 Comments

My Slides: “The Statistical Replication Crisis: Paradoxes and Scapegoats”

Posted on May 10, 2016 by Mayo

Below are the slides from my Popper talk at the LSE today (up to slide 70): (post any questions in the comments)

Categories: P-values, replication research, reproducibility, Statistics | 11 Comments

Some bloglinks for my LSE talk tomorrow: “The Statistical Replication Crisis: Paradoxes and Scapegoats”

Posted on May 9, 2016 by Mayo

In my Popper talk ~~tomorrow~~ today (in London), I will discuss topics in philosophy of statistics in relation to: the 2016 ASA document on P-values, and recent replication research in psychology. For readers interested in links from this blog, see:

I. My commentary on the ASA document on P-values (with links to the ASA document):

“Don’t Throw Out the Error Control Baby with the Bad Statistics Bathwater”

“A Small P-value Indicates that the Results are Due to Chance Alone: Fallacious or not: More on the ASA P-value Doc”

“P-Value Madness: A Puzzle About the Latest Test Ban, or ‘Don’t Ask, Don’t Tell’”

II. Posts on replication research in psychology: Continue reading →

Categories: Metablog, P-values, replication research, reproducibility, Statistics | 7 Comments

My Popper Talk at LSE: The Statistical Replication Crisis: Paradoxes and Scapegoats

Posted on May 4, 2016 by Mayo

I’m giving a Popper talk at the London School of Economics next Tuesday (10 May). If you’re in the neighborhood, I hope you’ll stop by.

A somewhat accurate blurb is here. I say “somewhat” because it doesn’t mention that I’ll talk a bit about the replication crisis in psychology, and the issues that crop up (or ought to) in connecting statistical results and the causal claim of interest.

Categories: Announcement | 6 Comments

Philosophy & Physical Computing Graduate Workshop at VT

Posted on May 2, 2016 by Mayo

A Graduate Summer Workshop at Virginia Tech (Poster)

Application deadline: May 8, 2016

PHILOSOPHY & PHYSICAL COMPUTING
JULY 11-24, 2016 at Virginia Tech

Who should apply:

This workshop is open to graduate students in master’s or PhD programs in philosophy or the sciences, including computer science.

For additional information or to apply online, visit thinkandcode.vtlibraries.org, or contact Dr. Benjamin Jantzen at bjantzen@vt.edu

Categories: Announcement | Leave a comment

Monthly Archives: May 2016

3 YEARS AGO (MAY 2013): MEMORY LANE

Allan Birnbaum: Foundations of Probability and Statistics (27 May 1923 – 1 July 1976)

Frequentstein: What’s wrong with (1 – β)/α as a measure of evidence against the null? (ii)

Fallacies of Rejection, Nouvelle Cuisine, and assorted New Monsters

Excerpts from S. Senn’s Letter on “Replication, p-values and Evidence”

My Slides: “The Statistical Replication Crisis: Paradoxes and Scapegoats”

Some bloglinks for my LSE talk tomorrow: “The Statistical Replication Crisis: Paradoxes and Scapegoats”

My Popper Talk at LSE: The Statistical Replication Crisis: Paradoxes and Scapegoats

Philosophy & Physical Computing Graduate Workshop at VT

A Graduate Summer Workshop at Virginia Tech (Poster)

Application deadline: May 8, 2016

PHILOSOPHY & PHYSICAL COMPUTING
JULY 11-24, 2016 at Virginia Tech

The Statistics Wars & Their Casualties

Blog links (references)

Reviews of Statistical Inference as Severe Testing (SIST)

Interviews & Debates on PhilStat (2020)

Interviews on PhilStat (2019)

LSE PH500 Research Seminar (May 21-June 25, 2020): Controversies in Phil Stat

Summer Seminar 2019 (article)

Top Posts & Pages

Conferences & Workshops

RMM Special Topic

Mayo & Spanos, Error Statistics

Follow Blog via Email

My Websites

Recent Posts: PhilStatWars

The Statistics Wars and Their Casualties Videos & Slides from Sessions 1 & 2

THE STATISTICS WARS AND THEIR CASUALTIES VIDEOS & SLIDES FROM SESSIONS 3 & 4

Final session: The Statistics Wars and Their Casualties: 8 December, Session 4

SCHEDULE: The Statistics Wars and Their Casualties: 1 Dec & 8 Dec: Sessions 3 & 4

WORKSHOP

LOG IN/OUT

Archives

© Deborah G. Mayo, Error Statistics Philosophy, 2011-2018 All Rights Reserved.

Monthly Archives: May 2016

A Graduate Summer Workshop at Virginia Tech (Poster)

Application deadline: May 8, 2016

PHILOSOPHY & PHYSICAL COMPUTING JULY 11-24, 2016 at Virginia Tech

The Statistics Wars & Their Casualties

Blog links (references)

Reviews of Statistical Inference as Severe Testing (SIST)

Interviews & Debates on PhilStat (2020)

Interviews on PhilStat (2019)

LSE PH500 Research Seminar (May 21-June 25, 2020): Controversies in Phil Stat

Summer Seminar 2019 (article)

Top Posts & Pages

Conferences & Workshops

RMM Special Topic

Mayo & Spanos, Error Statistics

Follow Blog via Email

My Websites

Recent Posts: PhilStatWars

LOG IN/OUT

Archives

© Deborah G. Mayo, Error Statistics Philosophy, 2011-2018 All Rights Reserved.

PHILOSOPHY & PHYSICAL COMPUTING
JULY 11-24, 2016 at Virginia Tech