akramer's comments

akramer · on May 27, 2023

The book you're thinking of is called "the gliding flight" and is my favorite paper airplane book.

https://www.amazon.com/Gliding-Flight-Paper-Make-Original-Ai...

akramer · on Jan 20, 2023

Yes, that would be negative correlation.

akramer · on Dec 20, 2022

Knipex Pliers Wrench is what you're looking for. They end up squeezing hard, unlike a crescent wrench, and do not round off fasteners as easily.

akramer · on Sept 22, 2022

The GCE API can be idempotent if you'd like. Fill out the requestId field with the same UUID in multiple instances.insert calls (or other mutation calls) and you will receive the same operation Id back in response.

Disclaimer: I work on GCE.

jsolson · on Sept 22, 2022

Today I learned! I'll admit I didn't know this functionality existed, and I've instead had used instances.insert following by querying the VM resource.

This is nicer!

akramer · on May 8, 2022

This is an interesting claim, because "being lost by your family at age 2 and essentially kidnapped and shipped to another country" ranks pretty high on my life impacting events list.

akramer · on Sept 10, 2021

The graph shows that those low p-values are more likely to be in papers, not that they’re more likely to occur. Is that suspicious? I don’t know enough about it to judge.

setgree · on Sept 10, 2021

> The graph shows that those low p-values are more likely to be in papers

This is an important distinction, in my experience [0].

Many papers will report a p-value only if it is below a significance threshold, otherwise they will report "n.s." (no statistic) or will give a range (e.g. p > .1). This just means that in addition to pressure to shelve insignificant results, publication bias also manifests as a tendency to emphasize and carefully report significant findings, while mentioning in passing those that don't meet whatever threshold.

[0] I happen to be working on a meta-analysis of psychology and public health papers at the moment. One paper that we're reviewing constructs 32 separate statistical models, reports that many of the results are not significant, and then discusses the significant results at length.

thaumasiotes · on Sept 10, 2021

> Many papers will report a p-value only if it is below a significance threshold, otherwise they will report "n.s." (no statistic) or will give a range (e.g. p > .1).

But the oddity here is a pronounced trend in the reported p-values that meet the significance threshold. The behavior you mention cannot create that trend.

thaumasiotes · on Sept 10, 2021

> The graph shows that those low p-values are more likely to be in papers, not that they’re more likely to occur.

It looks to me like the y-axis is measured in number of papers. The lower a p-value is, the more papers there are that happened to find a result beating the p-value.

So low p-values are more likely to occur a priori than high p-values are. This is most certainly not true in general. We might guess that psychologists are fudging their p-values somehow, or that journals are much, much, much, much, much, much, much more likely to publish "chewing a stalk of grass makes you walk slower, p < 0.013" than they are to publish "chewing a stalk of grass makes you walk slower, p < 0.04".

I've emphasized the level of bias the journals would need to be showing -- over fine distinctions in a value that is most often treated as a binary yes or no -- because it is much easier to get p < 0.04 than it is to get p < 0.013.

disgruntledphd2 · on Sept 10, 2021

Conditional on being published, this is true. Hence studies of the file-drawer effect and what not.

More generally, scientists are incentivised to find novel findings (i.e. unexpectedly low p-values) or lose their job.

Given that, the plot doesn't surprise me at all (Also, people will normally not report a bunch of non-significant results, which is a similar but unrelated problem).

eru · on Sept 10, 2021

That's part of why pre-registration of studies is so important.

mannykannot · on Sept 10, 2021

> This is most certainly not true in general.

Are you saying that in other disciplines, the distribution of p-values in published papers does not follow this pattern?

disgruntledphd2 · on Sept 10, 2021

I think what they meant was that we would expect the distribution of p-values to be uniform, if we had access to every p-value ever calculated (or a random sample thereof).

Publishing introduces a systematic bias, because it's difficult to get published where p>0.05 (or whatever the disciplinary standard is).

thaumasiotes · on Sept 10, 2021

> Publishing introduces a systematic bias, because it's difficult to get published where p>0.05 (or whatever the disciplinary standard is).

That explains why the p-values above 0.05 are rare compared to values below 0.05. But it fails to explain why p-values above 0.02 are rare compared to values below 0.02.

mannykannot · on Sept 10, 2021

I agree with your point from your previous post, that lower p-values are harder to get than higher ones, at least if one is looking at all possible causal relationships, but there are at least two possible causes for the inversion seen in publishing. The first is a general preference for lower p-values on the part of publishers and their reviewers (by 'general' I mean not just at the 0.05 value); the second is that researchers do not randomly pick what to study - they use their expertise and existing knowledge to guide their investigations.

Is that enough to tip the curve the other way across the range of p-values? Well, something is, and I am open to alternative suggestions.

One other point: while the datum immediately below 0.05 would normally be considered an outlier, the fact that it is next to a discontinuity (actual or perceived) renders that call less clear. Personally, I suspect it is not an accidental outlier, but given that it does not produce much distortion in the overall trend, I am less inclined to see the 0.05 threshold (actual or perceived) as a problem than I did before I saw this chart.

thaumasiotes · on Sept 10, 2021

> Personally, I suspect it is not an accidental outlier, but given that it does not produce much distortion in the overall trend, I am less inclined to see the 0.05 threshold (actual or perceived) as a problem than I did before I saw this chart.

Don't be fooled by the line someone drew on the chart. There's no particular reason to view this as a smooth nonlinear relationship except that somebody clearly wanted you to do that when they prepared the chart.

I could describe the same data, with different graphical aids, as:

- uniform distribution ("75 papers") between an eyeballed p < .02 and p < .05

- large spike ("95 papers") at exactly p = 0.4999

- sharp decline between p < .05 and p < .06

- uniform distribution ("19 papers") from p < .06 to p < .10

- bizarre, elevated sawtooth distribution between p < .01 and p < .02

And if I describe it that way, the spike at .05 is having exactly the effect you'd expect, drawing papers away from their rightful place somewhere above .05. If the p-value chart were a histogram like all the others instead of a scatterplot with a misleading visual aid, it would look pretty similar to the other charts.

mannykannot · on Sept 11, 2021

Well, you could extend this mode of analysis to its conclusion, for each dataset, and describe each datum in the data by its difference from its predecessor and successor, but if you do, does that help? I took it as significant that you wrote "...but it's an outlier from what is otherwise a regular pattern that clearly shows that smaller p-values are more likely to occur than larger ones are" (my emphasis) and that is what I am responding to.

I think we are both, in our own ways, making the point that there is more going on here than the spike just below 0.05 - namely, the regular pattern that you identified in your original post. If we differ, it seems to be because I think it is explicable.

WRT p-values of 0.05: I almost, but did not, say that if you curve-fitted above and below 0.05 independently, there would be a gap between the two, and maybe even if you left out the value immediately below 0.05. No doubt that would also happen for other values, but I am guessing that this gap would peak at 0.05. If I have time in the near future, I may try it. If you do, and find that I am wrong, I will be happy to recant.

akramer · on April 22, 2021

There's a reply to the LKML from the researcher in question admitting that the new student is also working under him doing research. He claims it's not related, but it's not clear how much his word is worth now...

https://lore.kernel.org/lkml/YIBBt6ypFtT+i994@pendragon.idea...

akramer · on March 2, 2021

I spend entirely too much money at Best Buy, and I’ve only written 5 star reviews for products I like. The bad products get immediately returned for a full refund - one of the advantages of buying from a big brick and mortar store. I never think about products I returned, I would prefer not to unless I’m given some incentive to write a bad review. Now, if I were stuck with the item, I’d probably trash it online in their review section.

spaetzleesser · on March 2, 2021

I guess a useful metric would be if they posted return rates, warranty cases and so on.

aSockPuppeteer · on March 2, 2021

This is one of the bonuses of a brick and mortar store if you can get a commissioned associate to tell you.

Ex: I’m not going to say phonefriend makes a bad phone but they have the highest return rate and people will buy a much more expensive Panasonic when they leave for a second time.

vel0city · on March 2, 2021

Same kind of experience here. On top of that, I'm usually a pretty picky purchaser who does a decent bit of market research ahead of time. So I'm probably more likely to only buy that good electronic item from a shop like Best Buy that I've got a high likelihood of giving a 4-5 star review.

akramer · on April 13, 2019

It appears to evaluate ruby code provided by the client given specific strings being set in a cookie.

akramer · on March 7, 2019

Each time I've seen something about GNU parallel pop up I've been tempted to post, but I've never made an account until now.

I wrote a very different style of command parallelizer that I named lateral. It doesn't require constructing elaborate commandlines that define all of your work at once. You start a server, and separate invocations of 'lateral run' add your commands to a queue to run on the server, including their filedescriptors. It makes for easier parallelization of complex arguments.

Take a look if this sort of thing interests you, as I haven't seen anyone write one like this before. Its primary difference is the ease with which each separate command can output to its own log, and the lack of need to play games with shell quoting and positional arguments.

Check it out: https://github.com/akramer/lateral

the_it_girl · on March 8, 2019

I think it is good you finally made an account: How are people going to find your software if you do not tell them about it :)

Can you make a comparison between lateral and sem?

https://www.gnu.org/software/parallel/sem.html

arbie · on March 8, 2019

This looks neat! Much <3 for using Golang and YAML.

Can a single lateral server queue be used across multiple host machines? And in the other direction, can lateral launch and monitor processes that reside across multiple machines?