0/n Thank all of you who participated in 'The demon game'. I am taking a screenshot because when knowing the whys it loses all value (there is no more asymmetry of information). These 182 responses are 'The sample'.
1/n You may have already known about this thought experiment you just run on, mainly because there are many different variants of it in the literature. This is the one that I have seen lately:
2/n This example is good because the results are clear-cut to show 2 typical sources of error. Poor experimental setups are the bain of our existence and there are myriad ways they can go wrong.
3/n These are pretty common and easy to make mistakes for anyone that is not on the professional side of understanding how to extract information from people. There is nothing to be ashamed of if you make them. There is one important invariant.
4/n The first and probably the most obvious for anyone is: "Framing". The 'evil demon' immediately triggers your innate mechanism of self-preservation, therefore pushes you to the safest option. You can see how different the results are when the daemon is *fair*.
5/n One thing I can say is: this comment made my day. That is the best example you could get of how 'Framing' works.
6/n Now, this is a one-off, I cannot run it again. This begs the question of what would be the results if there is no demon and/or Russian Roulette at all.
7/n As a real-life example, it is pretty easy to botch UX interview when the interviewer has no experience in such tests (being there, done that). For example, Bernoulli experiments (pass/no-pass) are extremely sensitive to framing. Even a single word can make a huge difference.
8/n A second common error is interviewee imperfect information or in this case, concealed information. In the example, the interviewer knows pretty well what the risk is, but doesn't provide you any contrasting information to assess the true risk. bandolier.org.uk/booth/Risk/dyi…
9/n This mechanic is useful when you want to assess if there is a sequence of actions dependence between decisions. In UX, if possible, you would usually run the same set of tasks in different orders to ensure you can control sequence dependence.
10/n Why did I do the changed example? I just provided the other side to provide proof that as I expected, it is pretty sensitive.
11/n It is pretty known also that assessing risk is something 'we' humans are very bad at. By introducing this information asymmetry (or in this case, concealing the true risk of living) we can inadvertently bias the results.
12/n And that is why even 'simple' thought experiments should be designed in such a way to avoid the controller/designer/interviewer biasing the sample. Hope this crash course on experimental design pitfalls helps you in the future.
• • •
Missing some Tweet in this thread? You can try to
force a refresh
1/ After almost 1.5 years of studying cancer research for personal reasons, I arrived at a realization that prompted me to write this tweet. I will lay out the hypothesis in this thread.
2/ Disclaimer: I am not a formally trained health researcher. More like a very curious and tenacious guy with a 15+ year background in research, development, & reproducibility in computer science (computer science).
3/ I am putting the hypothesis out there because it may make sense to others doing field work. Feel free to dissect this hypothesis, find holes in it, and play devil's advocate. We will all come out smarter from it.
1/ There is a very perverse dynamic on how Chavism (aka "the communist socialism") works. Let's use Argentina as the example. Over the first 20 years they initiate a process that we could call "Earnings Substitution" that will seal your fate over time.
2/ Your earnings/salary is going down and at the same time "subsidies" start to go up in order to fool people into think that nothing has changed. This works because the dirty job is done by inflation which is a much slower process.
3/ By the time people starts to realize that something is wrong, because some critical goods are not available (medicine, food, you name it) or inflation enters a death spiral; most people already depend on subsidies for spending.
1/ Recently some interesting papers have been doing the rounds in the health community. To me the most interesting ones have been the GlyNAC paper and the more recent Taurine deficiency as a driver of aging papers.
2/ Disclaimer: While I have been researching this for a year and even executed an experimental protocol tailored for myself based on the GlyNAC paper, I am NOT a health professional, and I am just taking my health into my own hands. This is not advice of any kind.
3/ Disclaimers aside, why do I think these 2 papers are interesting? First because the claim (if true) is a game changer. And second because they may be related but I haven’t seen this relationship spotlighted by anyone.
This just confirmed the weaponization of block lists. If enough people/bots block and mute you, they are essentially cancelling you. I find lots of people with I have never interacted with that has me blocked. Assuming there are third party block lists and block networks.
Normally that is an issue in general. Anyone that has done reinforcement learning had figure out (usually in the worst way) that you have to be incredible cautious with penalties. They are very prone to be gamed.
2/ Since the general problem that practitioners find (in the worst way) is always training set tainting (guilty-as-charged). Habits die hard, the first thing I did is asking to do a review of the paper without any extra knowledge about what the paper says
3/ From the response alone I learned 2 things. First, our paper title was deadly accurate. I also learned that it has no information whatsoever on it, as the entire response can be generated from understanding the title itself.
2/ Since I am doing it by hand I started with a very simple prompt.
3/ I have been arguing that this trying to constrain the model is actually harming it before. This is one of those cases. The good thing is that at least for you just add "Use the tokens" at the end of the request when it refuses and it will do it properly