"Challenges" ispell pass

[Ultimately_Untrue_Thought.git] / content / drafts / challenges-to-yudkowskys-pronoun-reform-proposal.md
diff --git a/content/drafts/challenges-to-yudkowskys-pronoun-reform-proposal.md b/content/drafts/challenges-to-yudkowskys-pronoun-reform-proposal.md

index 6a2b99d..02b28e9 100644 (file)
--- a/content/drafts/challenges-to-yudkowskys-pronoun-reform-proposal.md
+++ b/content/drafts/challenges-to-yudkowskys-pronoun-reform-proposal.md
@@ -1,7 +1,7 @@
  Title: Challenges to Yudkowsky's Pronoun Reform Proposal
  Date: 2022-01-01 11:00
  Category: commentary
-Tags: Eliezer Yudkowsky
+Tags: Eliezer Yudkowsky, convention
  Status: draft
  
  > Go, Soul, the body's guest,  
@@ -13,7 +13,25 @@ Status: draft
  >
  > —["The Lie" by Walter Raleigh](https://www.lesswrong.com/posts/trb9HPWFk8Gy9MBdN/less-wrong-poetry-corner-walter-raleigh-s-the-lie)
  
-[TODO: summary points]
+### Summary
+
+ * In a February 2021 Facebook post, Eliezer Yudkowsky inveighs against English's system of singular third-person pronouns: as a matter of language design, English's lack of a gender-neutral singular third-person pronoun is a serious flaw: you shouldn't be required to commit to a stance on what sex someone is in order to say a grammatical sentence about her or him.
+
+ * This seems fine as a critique of the existing English language. However, Yudkowsky then goes on to proclaim, in connection with pronouns for transgender people, that "the simplest and best protocol is, '"He" refers to the set of people who have asked us to use "he", with a default for those-who-haven't-asked that goes by gamete size' and to say that this just _is_ the normative definition. Because it is _logically_ rude, not just socially rude, to try to bake any other more complicated and controversial definition _into the very language protocol we are using to communicate_."
+
+ * However, this allegedly "simplest and best" proposal fails to achieve its stated aim of avoiding baking controversial claims into the language grammar. **The _reason_ trans people want others to use their designated pronouns is _because_ they're trying to control their socially-perceived sex category** and English speakers interpret _she_ and _he_ as conveying sex-category information. Yudkowsky's proposed circular redefinition is functionally "hypocritical": **if it were _actually true_ that _he_ simply referred to those who take the pronoun _he_, then there would be no reason for trans people to care which pronoun people used for them.**
+
+ * **The "meaning" of language isn't some epiphenominal extraphysical fact that can be declared or ascertained separately from common usage.** The word "dog" means what it does _because_ English speakers use the word that way; if you wanted "dog" to mean something different, you'd need to change the way English speakers behave. Thus, **circularly redefining _he_ and _she_ as purportedly referring to pronoun preferences rather than sex doesn't work, if people are still in practice choosing pronouns on the basis of perceived sex.**
+
+ * **Given that _she_ and _he_ do in fact convey sex category information to English speakers, some speakers might perceive an interest in refusing demands to use pronouns in a way that contradicts their perception of what sex people are.** This does _not_ constitute a philosophical commitment that pronouns can be "lies" as such.
+
+ * In the comments of the Facebook post, Yudkowsky seemingly denies that pronouns convey sex category information to native English speakers, claiming, "I do not know what it feels like from the inside to feel like a pronoun is attached to something in your head much more firmly than 'doesn't look like an Oliver' is attached to something in your head." **This self-report is not plausible, as evidenced by previous writings by Yudkowsky that treat sex and pronouns as synonymous.**
+
+ * **I'm _not_ claiming that Yudkowsky should have a different pronoun usage policy.** I agree that misgendering all trans people "on principle" seems very wrong and unappealing. Rather, I'm claiming that [**policy debates should not appear one-sided**](https://www.lesswrong.com/posts/PeSzc9JTBxhaYRp9b/policy-debates-should-not-appear-one-sided): in order to be politically neutral in your analysis of why someone might choose one pronoun policy over another, you need to _acknowledge_ the costs and benefits of a policy to different parties. **It can simultaneously be the case that pressuring speakers to use pronouns at odds with their perceptions of sex is a cost to those speakers, _and_ that failing to exert such pressure is a cost to trans people.** It's possible and desirable to be honest about that cost–benefit analysis, while ultimately choosing a policy that favors some parties' interests over others.
+
+ * **People with gender dysphoria who are considering whether to transition need _factually accurate information_ about gender-transition interventions**: if you have the facts wrong, you might wrongly avoid an intervention that would have benefited you, or wrongly undergo an intervention that harms you. **This includes facts about how pronouns work in the existing English language.** If it were _actually true_ that the simplest and best convention is that _he_ refers to the set of people who have asked us to use _he_, then asking for new pronouns despite not physically passing as the corresponding sex wouldn't be costly. But in fact, it is costly. As someone with a history of gender problems, this is decision-relevant to me. Thus, Yudkowsky is harming a reference class of people that includes me by spreading disinformation about the costs of asking for new pronouns; **I'm better off because I don't trust Eliezer Yudkowsky to tell the truth.**
+
+<p class="flower-break">⁕ ⁕ ⁕</p>
  
  [In a February 2021 Facebook post, Eliezer Yudkowsky inveighs against English's system of singular third-person pronouns](https://www.facebook.com/yudkowsky/posts/10159421750419228). As a matter of clean language design, English's lack of a gender-neutral singular third-person pronoun is a serious flaw. The function of pronouns is to have a brief way to refer back to entities already mentioned: it's more concise to be able to say "Katherine put her book on its shelf" rather than "Katherine put Katherine's book on the book's shelf". But then why couple that grammatical function to sex-category membership? You shouldn't _need_ to take a stance on someone's sex in order to talk about [her or](/2020/Apr/the-reverse-murray-rule/) him putting a book on the shelf.
  
@@ -25,13 +43,13 @@ It doesn't have to be this way! If you were fortunate enough to be in the positi
  
  If you grew up speaking English, gendered pronouns feel "normal" while gendered [noun classes](https://en.wikipedia.org/wiki/Noun_class) in many other languages (where, _e.g._, in French, a dog, _le chien_, is "masculine", but potatoes, _la pommes de terre_, are "feminine") seem strange and unnecessary, but someone who grew up with neither would regard both as strange. If you spoke a language that didn't _already_ have gendered pronouns, you probably wouldn't be spontaneously eager to add them.
  
-All this seems fine as a critique of the existing English pronoun system! However, I argue that Yudkowsky's prescriptions for English speakers going forward goes badly wrong. First, Yudkowsky argues that it's bad for stances on complicated empirical issues to be baked into the language grammar itself: since people might disagree on who fits into the [empirical clusters](https://www.lesswrong.com/posts/WBw8dDkAWohFjWQSk/the-cluster-structure-of-thingspace) of "female" and "male", you don't want people to be forced to make a call on that just in order to be able to use a pronoun.
+All this seems fine as a critique of the existing English pronoun system! However, I argue that Yudkowsky's prescription for English speakers going forward goes badly wrong. First, Yudkowsky argues that it's bad for stances on complicated empirical issues to be part of the language grammar itself: since people might disagree on who fits into the [empirical clusters](https://www.lesswrong.com/posts/WBw8dDkAWohFjWQSk/the-cluster-structure-of-thingspace) of "female" and "male", you don't want speakers to be forced to make a call on that just in order to be able to use a pronoun.
  
  Fair enough. Sounds like an argument for universal singular _they_ (and eating the cost of increased collisions where it's ambiguous which subject an instance of _they_ would refer to): if you don't think pronouns should convey sex-category information, then don't use pronouns that convey sex-category information! But then, in an unexplained leap, Yudkowsky proclaims:
  
  > So it seems to me that the simplest and best protocol is, "'He' refers to the set of people who have asked us to use 'he', with a default for those-who-haven't-asked that goes by gamete size" and to say that this just _is_ the normative definition. Because it is _logically_ rude, not just socially rude, to try to bake any other more complicated and controversial definition _into the very language protocol we are using to communicate_.
  
-The problem with this is that [the alleged rationale for the proposal does not support the proposal](https://www.lesswrong.com/posts/i6fKszWY6gLZSX2Ey/fake-optimization-criteria). If your default pronoun for those-who-haven't-asked goes by perceived sex (which one presumes is what Yudkowsky means by "gamete size"—we almost never _observe_ people's gametes), then you're still baking sex-category information into the language protocol in the form of the default! Moreover, this is clearly an "intended" rather than an accidental effect of the proposal, in the sense that a policy that _actually_ avoided baking sex-category information into the language (like universal singular _they_, or name-initial- or hair-color-based pronouns) would not have the same appeal to many of those who support self-chosen pronouns: _why_ is it that some people would want to opt-out of the sex-based default?
+The problem with this is that [the alleged rationale for the proposal does not support the proposal](https://www.lesswrong.com/posts/i6fKszWY6gLZSX2Ey/fake-optimization-criteria). If your default pronoun for those-who-haven't-asked goes by perceived sex (which one presumes is what Yudkowsky means by "gamete size"—we almost never _observe_ people's gametes), then you're still baking sex-category information into the language protocol in the form of the default! Moreover, this is clearly an "intended" rather than an accidental effect of the proposal, in the sense that a policy that _actually_ avoided baking sex-category information into the language (like universal singular _they_, or name-initial- or hair-color-based pronouns) would not have the same appeal to those who support self-chosen pronouns: _why_ is it that some people would want to opt-out of the sex-based default?
  
  Well, it would seem that the motivating example—the causal–historical explanation for why we're having this conversation about pronoun reform in the first place—is that trans men (female-to-male transsexuals) prefer to be called _he_, and trans women (male-to-female transsexuals) prefer to be called _she_. (Transsexuals seem much more common than people who just have principled opinions about pronoun reform without any accompanying desire to change what sex other people perceive them as.)
  
@@ -69,19 +87,19 @@ In the [words of](https://twitter.com/pangmeli/status/1079097805250224130) [anot
  
  These authors are to be commended for making their view so clear and explicit: in order to not betray your trans friends (according to this view), you need to think of them as the gender that they say they are. Mere verbal pronoun compliance in the absence of underlying belief is insufficient and possibly treacherous.
  
-This point that pronoun changes are desired precisely _because_ of what they _do_ imply about sex categories in the existing English language is a pretty basic one, that one should think should scarcely need to be explained. And yet Yudkowsky steadfastly ignores the role of existing meanings in this debate, bizarrely writing as if we were defining a conlang from scratch:
+This point that pronoun changes are desired precisely _because_ of what they _do_ imply about sex categories in the existing English language is a pretty basic one, that one would think should scarcely need to be explained. And yet Yudkowsky steadfastly ignores the role of existing meanings in this debate, bizarrely writing as if we were defining a conlang from scratch:
  
  > It is Shenanigans to try to bake your stance on how clustered things are and how appropriate it is to discretely cluster them using various criteria, _into the pronoun system of a language and interpretation convention that you insist everybody use!_
  
-There are a couple of problems with this. First of all, the "that you insist everybody use" part is a pretty blatant [DARVO](https://en.wikipedia.org/wiki/DARVO) in the current political environment around Yudkowsky's social sphere. A lot of the opposition to self-chosen pronouns is about opposition to _compelled speech_: people who don't think some trans person's transition should "count"—however cruel or capricious that might be—don't want to be coerced into legitimizing it with the pronoun choices in their _own_ speech. That's different from insisting that _others_ use sex-based non-subject-preferred pronouns, which is not something I see much of outside of gender-critical ("TERF") forums. Characterizing the issue as being about "freedom of pronouns", [as Yudkowsky does in the comment section](https://www.facebook.com/yudkowsky/posts/10159421750419228?comment_id=10159421833274228), elides the fact that freedom to specify how _other people_ talk about you is in direct conflict with the freedom of speech of speakers! No matter which side of the conflict one supports, it seems wrong to characterize the self-ID pronoun side as being "pro-freedom", as if there wasn't any "freedom" concerns on the other side.
+There are a couple of problems with this. First of all, the "that you insist everybody use" part is a pretty blatant [DARVO](https://en.wikipedia.org/wiki/DARVO) in the current political environment around Yudkowsky's social sphere. A lot of the opposition to self-chosen pronouns is about opposition to _compelled speech_: people who don't think some trans person's transition should "count"—however cruel or capricious that might be—don't want to be coerced into legitimizing it with the pronoun choices in their _own_ speech. That's different from insisting that _others_ use sex-based non-subject-preferred pronouns, which is not something I see much of outside of gender-critical ("TERF") forums. That is, in the world I see, the pronouns-by-self-identity faction is _overwhelmingly_ the one "insist[ing] everybody use" their preferred convention. Characterizing the issue as being about "freedom of pronouns", [as Yudkowsky does in the comment section](https://www.facebook.com/yudkowsky/posts/10159421750419228?comment_id=10159421833274228), elides the fact that freedom to specify how _other people_ talk about you is in direct conflict with the freedom of speech of speakers! No matter which side of the conflict one supports, it seems wrong to characterize the self-ID pronoun side as being "pro-freedom", as if there wasn't any "freedom" concerns on the other side.
  
-If you _actually_ believed it was Shenanigans to bake a stance on how clustered things are into a pronoun system and insist that everyone else use it, then it should be _equally_ Shenanigans independently of whether the insisted-on clusters are those of sex or those of gender identity—if you're going to be consistent, you should condemn them _both_. And yet _somehow_, the people who insist on sex-based pronouns are the target of Yudkowsky's condescension, whereas the people who insist on gender-identity-based pronouns get both a free pass, _and_ endorsement of their preferred convention (albeit for a different stated reason)? The one-sidedness here is pretty shameless!
+If you _actually_ believed it was Shenanigans to bake a stance on how clustered things are into a pronoun system and insist that everyone else use it, then it should be _equally_ Shenanigans independently of whether the insisted-on clusters are those of sex or those of gender identity—if you're going to be consistent, you should condemn them _both_. And yet _somehow_, people who insist on sex-based pronouns are the target of Yudkowsky's condescension, whereas people who insist on gender-identity-based pronouns get both a free pass, _and_ endorsement of their preferred convention (albeit for a different stated reason)? The one-sidedness here is pretty shameless!
  
-Perhaps more importantly, however, in discussing how to reform English, we're not actually in the position of defining a language from scratch. Even if you think the [cultural evolution](/2020/Jan/book-review-the-origins-of-unfairness/) of English involved Shenanigans, it's not fair to attribute the Shenanigans to native speakers accurately describing their native language. Certainly, language can evolve; words can change meaning over time; if you can get the people in some community to start using language differently, then you have _ipso facto_ changed their language. But when we consider language as an information-processing system that we can reason about using our standard tools of probability and game theory, we see that in order to change the meaning associated with a word, you actually _do_ have to somehow get people to change their usage. You can _advocate_ for your new meaning and use it in your own speech, but you can't just _declare_ your preferred new meaning and claim that it applies to the language as actually spoken, without speakers actually changing their behavior. As a result, Yudkowsky's proposal "to say that this just _is_ the normative definition" doesn't work.
+Perhaps more important than the speaker-freedom _vs._ subject-freedom issue, however, is that in discussing how to reform English, we're not actually in the position of defining a language from scratch. Even if you think the [cultural evolution](/2020/Jan/book-review-the-origins-of-unfairness/) of English involved Shenanigans, it's not fair to attribute the Shenanigans to native speakers accurately describing their native language. Certainly, language can evolve; words can change meaning over time; if you can get the people in some community to start using language differently, then you have _ipso facto_ changed their language. But when we consider language as an information-processing system, we see that in order to change the meaning associated with a word, you actually _do_ have to somehow get people to change their usage. You can _advocate_ for your new meaning and use it in your own speech, but you can't just _declare_ your preferred new meaning and claim that it applies to the language as actually spoken, without speakers actually changing their behavior. As a result, Yudkowsky's proposal "to say that this just _is_ the normative definition" doesn't work.
  
  To be clear, when I say that the proposal doesn't work, I'm not even saying I disagree with it. I mean that it literally, _factually_ doesn't work! Let me explain.
  
-The "meaning" of language isn't some [epiphenominal](https://www.lesswrong.com/posts/fdEWWr8St59bXLbQr/zombies-zombies) extraphysical fact that can be declared or ascertained separately from common usage. We can only say that the English word "dog" means [these-and-such four-legged furry creatures](https://en.wikipedia.org/wiki/Dog), _because_ English speakers actually use the word that way. [The meaning "lives" in the systematic correspondence between things in the world and what communication signals are sent.](https://www.lesswrong.com/posts/4hLcbXaqudM9wSeor/philosophy-in-the-darkest-timeline-basics-of-the-evolution)
+The "meaning" of language isn't some [epiphenomenal](https://www.lesswrong.com/posts/fdEWWr8St59bXLbQr/zombies-zombies) extraphysical fact that can be declared or ascertained separately from common usage. We can only say that the English word "dog" means [these-and-such four-legged furry creatures](https://en.wikipedia.org/wiki/Dog), _because_ English speakers actually use the word that way. [The meaning "lives" in the systematic correspondence between things in the world and what communication signals are sent.](https://www.lesswrong.com/posts/4hLcbXaqudM9wSeor/philosophy-in-the-darkest-timeline-basics-of-the-evolution)
  
  There's nothing magical about the particular word/symbol/phoneme-sequence "dog", of course. In German, they say _Hund_; in Finnish, they say _koira_; in Korean, they say _개_. Germans and Finns and Koreans (and their dogs) seem to be getting along just as well as we Anglophones.
  
@@ -109,17 +127,17 @@ In an article titled ["Pronouns are Rohypnol"](https://fairplayforwomen.com/pron
  
  Kerr suggests that preferred pronouns have a similar effect, that "a conflict between what we see and know to be true, and what we are expected to say, affects us." As an exercise, she suggests (privately!) translating sentences about transgender people to use natal-sex-based pronouns.
  
-Unfortunately, I don't have a study with objective measurements on hand (let me know in the comments if you do!), but I think most native English speakers who try this exercise and introspect—especially using examples where the trans person exhibits features or behavior typical of their natal sex, with things like "she ejaculated" or "he gave birth" being the starkest examples—will agree with Kerr's assessment: "You can know perfectly the actual sex of a male person, and yet you will still react differently if someone calls them _she_ instead of _he_."
+Unfortunately, I don't have a study with objective measurements on hand, but I think most native English speakers who try this exercise and introspect—especially using examples where the trans person exhibits features or behavior typical of their natal sex, with things like "she ejaculated" or "he gave birth" being the starkest examples—will agree with Kerr's assessment: "You can know perfectly the actual sex of a male person, and yet you will still react differently if someone calls them _she_ instead of _he_."
  
-Let's relate this is Yudkowsky's specialty of artificial intelligence. In a post on ["Multimodal Neurons in Artificial Neural Networks"](https://openai.com/blog/multimodal-neurons/), Gabriel Goh _et al._ explore the capabilities and biases of the [CLIP](https://openai.com/blog/clip/) neural network trained on textual and image data.
+Let's relate this to Yudkowsky's specialty of artificial intelligence. In a post on ["Multimodal Neurons in Artificial Neural Networks"](https://openai.com/blog/multimodal-neurons/), Gabriel Goh _et al._ explore the capabilities and biases of the [CLIP](https://openai.com/blog/clip/) neural network trained on textual and image data.
  
-There are some striking parallels between CLIP's behavior, and phenomena observed in neuroscience. Neurons in the human brain have been observed to respond to the same concept represented in different modalities (_e.g._, [Quiroga _et al._](/papers/quiroga_et_al-invariant_visual_representation_by_single_neurons.pdf) observed a neuron in one patient that responded to photos and sketches of actress Halle Berry, as well as the text string "Halle Berry"), and so do CLIP neurons. Futhermore, CLIP is vulnerable to a Stroop-like effect where its image-classification capabilities can be fooled by "typographic attacks"—a dog with instances of the text "$$$" superimposed over it gets classified as a piggy bank, an apple with a handwritten sign saying "LIBRARY" gets classified as a library. The network knows perfectly what dogs and apples look like, and yet still reacts differently if adjacent text calls them something else.
+There are some striking parallels between CLIP's behavior, and phenomena observed in neuroscience. Neurons in the human brain have been observed to respond to the same concept represented in different modalities; for example, [Quiroga _et al._](/papers/quiroga_et_al-invariant_visual_representation_by_single_neurons.pdf) observed a neuron in one patient that responded to photos and sketches of actress Halle Berry, as well as the text string "Halle Berry". It turns out that CLIP neurons also exhibit this multi-modal responsiveness. Furthermore, CLIP is vulnerable to a Stroop-like effect where its image-classification capabilities can be fooled by "typographic attacks"—a dog with instances of the text "$$$" superimposed over it gets classified as a piggy bank, an apple with a handwritten sign saying "LIBRARY" gets classified as a library. The network knows perfectly what dogs and apples look like, and yet still reacts differently if adjacent text calls them something else.
  
  I conjecture that the appeal of subject-chosen pronouns lies _precisely_ in how they exert Stroop-like effects on speakers' and listeners' cognition. (Once again, if it were _actually true_ that _she_ and _he_ had no difference in meaning, _there would be no reason to care_.) [Pronoun badges](/2018/Oct/sticker-prices/) are, quite literally, a typographic attack against native English speakers' brains.
  
-Note, I mean this as a value-free description of how the convention _actually functions_ in the real world, [not a condemnation](https://www.lesswrong.com/posts/N9oKuQKuf7yvCCtfq/can-crimes-be-discussed-literally). One could consistently hold that these "attacks" are morally good. (Analagously, [supernormal stimuli](https://www.lesswrong.com/posts/Jq73GozjsuhdwMLEG/superstimuli-and-the-collapse-of-western-civilization) like chocolate or pornography are "attacks" against the brain's evolved nutrition and reproductive-opportunity detectors, but most people are fine with this, because our goals are not evolution's.)
+Note, I mean this as a value-free description of how the convention _actually functions_ in the real world, [not a condemnation](https://www.lesswrong.com/posts/N9oKuQKuf7yvCCtfq/can-crimes-be-discussed-literally). One could consistently hold that these "attacks" are morally good. (Analogously, [supernormal stimuli](https://www.lesswrong.com/posts/Jq73GozjsuhdwMLEG/superstimuli-and-the-collapse-of-western-civilization) like chocolate or pornography are "attacks" against the brain's evolved nutrition and reproductive-opportunity detectors, but most people are fine with this, because our goals are not evolution's.)
  
-Is susceptibility to Stroop-like effects an indication of bad mind design? I mean, probably! One would expect that an intelligently-designed agent (as contrasted to messy human brains coughed up [blind evolution](https://www.lesswrong.com/posts/jAToJHtg39AMTAuJo/evolutions-are-stupid-but-work-anyway) or [lucky](https://www.lesswrong.com/posts/dpzLqQQSs7XRacEfK/understanding-the-lottery-ticket-hypothesis) neural networks found by gradient descent) could easily bind and re-bind symbols on the fly, such that a sane AI from the future could use whatever pronouns without dredging up any inapplicable mental associations, and tell you the color of the text "<span style="color:blue;">red</span>" just as easily as "<span style="color:red;">red</span>". But it seems kind of idle to criticize humans for not having a capability (natural language fluency without Stroop-like effects) that we don't even know how to implement in a computer program.
+Is susceptibility to Stroop-like effects an indication of bad mind design? I mean, probably! One would expect that an intelligently-designed agent (as contrasted to messy human brains coughed up by [blind evolution](https://www.lesswrong.com/posts/jAToJHtg39AMTAuJo/evolutions-are-stupid-but-work-anyway) or [lucky](https://www.lesswrong.com/posts/dpzLqQQSs7XRacEfK/understanding-the-lottery-ticket-hypothesis) neural networks found by gradient descent) could easily bind and re-bind symbols on the fly, such that a sane AI from the future could use whatever pronouns without dredging up any inapplicable mental associations, and tell you the color of the text "<span style="color:blue;">red</span>" just as easily as "<span style="color:red;">red</span>". But it seems kind of idle to criticize humans for not having a capability (natural language fluency without Stroop-like effects) that we don't even know how to implement in a computer program.
  
  Back to Kerr's article—importantly, Kerr is _explicitly_ appealing to psychological effects of different pronoun conventions. She is absolutely _not_ claiming that the use of preferred pronouns is itself a "lie" about some testable proposition. She writes:
  
@@ -157,9 +175,15 @@ And the thing is, Eliezer Yudkowsky is a native English speaker born in 1979. As
  
  I would bet at very generous odds at some point in his four decades on Earth, Eliezer Yudkowsky has used _she_ or _he_ on the basis of perceived sex to refer to someone whose name he didn't know. Because _all native English speakers do this_. Moreover, we can say something about the [cognitive algorithm](https://www.lesswrong.com/posts/HcCpvYLoSFP4iAqSz/rationality-appreciating-cognitive-algorithms) underlying _how_ they do this: for example, [people can recognize sex from facial photos _alone_ (hair covered, males clean-shaven) at 96% accuracy](/papers/bruce_et_al-sex_discrimination_how_do_we_tell.pdf). In naturalistic settings where we can see and hear more [secondary sex characteristics](https://en.wikipedia.org/wiki/Secondary_sex_characteristic#In_humans) than just someone's face (build, height, breasts, [voice](/papers/puts_et_al-masculine_voices_signal_mens_threat_potential.pdf), [gait](https://sillyolme.wordpress.com/2010/09/24/all-the-wrong-moves/), _&c_.), accuracy would be even greater. It's not a mystery why people can get sex-based pronouns "right" the vast majority of the time without having to be told or remember specific people's pronouns.
  
-Conversely, I would also bet at very generous odds that in his four decades on Earth, Eliezer Yudkowsky has very rarely if ever assumed what someone's name is on the basis of their appearance without being told. Because _no native English speakers do this_ (seriously, rather than as a joke or a troll). If you doubt this, try to explain what algorithm you would use to infer that someone's name is "Oliver" based on how they look. What are the "secondary Oliver characteristics", specifically? People for whom it was _actually true_ that names map to appearances the way pronouns map to sex, should not have trouble answering this question!
+Conversely, I would also bet at very generous odds that in his four decades on Earth, Eliezer Yudkowsky has very rarely if ever assumed what someone's name is on the basis of their appearance without being told. Because _no native English speakers do this_ (seriously, rather than as a joke or a troll). Now, it's true that the "doesn't look like an Oliver" example _was_ introduced into the discussion by another commenter, [who recounts once having called someone Bill who had introduced himself as Oliver](https://www.facebook.com/yudkowsky/posts/10159421750419228?comment_id=10159421986539228&reply_comment_id=10159422872574228):
+
+> It did feel a little weird calling him Oliver, but everyone present knew what I was doing was being a jerk and teenagers are horrible. The "feels like lying" principle seems like it lets me keep calling him Bill, now righteously. I just can't even really bring myself to play in that sandbox in good faith.
+
+But the "everyone present knew what I was doing was being a jerk" characterization seems to agree that the motivation was joking/trolling. _How_ did everyone present know? Because it's absurd to infer a _particular_ name from someone's appearance.
  
-[TODO: this is why I feel comfortable saying the commenter who introduced "Oliver" was trolling in her teenage memory]
+It's true that there are name–feature correlations that observers can pick up on. For example, a "Juan" is likely to be Latino, a "Gertrude" in the current year is [likely to be old](https://www.everything-birthday.com/name/f/Gertrude); a non-Hispanic white Juan or a young Gertrude may indeed be likely to provoke a "Doesn't look like an _X_" reaction (which may also be sensitive to even subtler features). But while probabilistic inferences from features to low _likelihood_ of a particular name are valid, an inference from features to a particular name is absolutely not, because the function of a name is to be an opaque "pointer" to a particular individual. A Latino family choosing a name for their male baby may be somewhat more likely to choose "Juan" rather than "Oliver" (or "Gertrude"), but they could just as easily choose "Luis" or "Miguel" or "Alejandro" for the very same child, and there's no plausible physical mechanism by which a horrible teenager thirty years later could tell the difference.
+
+Thus, I reject the commenter's claim that "feels like lying" intuitions about pronouns and sex would have let her "keep calling him Bill, now righteously". What algorithm you would use to infer that someone's name is "Bill" based on how he looks? What are the "secondary Oliver characteristics", specifically? People for whom it was _actually true_ that names map to appearances the way pronouns map to sex, should not have trouble answering this question!
  
  If there _were_ a substantial contingent of native English speakers who don't interpret pronouns as conveying sex category information, one would expect this to show up in our cultural corpus more often—and yet, I'm actually not aware of any notable examples of this. In contrast, it's very easy to find instances of speakers treating pronouns and sex as synonymous. As an arbitrarily chosen example, in [one episode](https://theamazingworldofgumball.fandom.com/wiki/The_Nest) of the animated series [_The Amazing World of Gumball_](https://tvtropes.org/pmwiki/pmwiki.php/WesternAnimation/TheAmazingWorldOfGumball) featuring the ravenous spawn of our protagonists' evil pet turtle, the anthropomorphic-rabbit [Bumbling Dad](https://tvtropes.org/pmwiki/pmwiki.php/Main/BumblingDad) character [says, "Who's to say this pregnant turtle is a _her_?" and everyone gives him a look](https://www.youtube.com/watch?v=5N2Msnrq7wU&t=14s).
  
@@ -167,9 +191,9 @@ The joke, you see, is that bunny-father is unthinkingly applying the stock quest
  
  _The Amazing World of Gumball_ is rated [TV-Y7](https://rating-system.fandom.com/wiki/TV-Y7) and the episode in question came out in 2016. This is not a particularly foreign or distant cultural context, nor one that is expected to tax the cognitive abilities of a seven-year-old child! Is ... is Yudkowsky claiming not to get the joke?
  
-Posed that way, one would imagine not—but if Yudkowsky _does_ get the joke, then I don't think he can simultaneously _honestly_ claim to "not know what it feels like from the inside to feel like a pronoun is attached to something in your head much more firmly than 'doesn't look like an Oliver' is attached to something in your head." In order to get the joke in real time, your brain has to quickly make a multi-step logical inference that depends on the idea that pronouns imply sex. (The turtle is a "her" [iff](https://en.wikipedia.org/wiki/If_and_only_if) female, not-female implies not-pregnant, so if the turtle is pregnant, it must be a "her".) This would seem, pretty straightforwardly, to be a sense in which "a pronoun is attached to something in your head much more firmly than 'doesn't look like an Oliver' is attached to something in your head." I'm really not sure how else I'm supposed to interpret those words!
+Posed that way, one would imagine not—but if Yudkowsky _does_ get the joke, then I don't think he can simultaneously _honestly_ claim to "not know what it feels like from the inside to feel like a pronoun is attached to something in your head much more firmly than 'doesn't look like an Oliver' is attached to something in your head." In order to get the joke in real time, your brain has to quickly make a multi-step logical inference that depends on the idea that pronouns imply sex. (The turtle is a "her" [iff](https://en.wikipedia.org/wiki/If_and_only_if) female, not-female implies not-pregnant, so if the turtle is pregnant, it must be a "her".) This would seem, pretty straightforwardly, to be a sense in which "a pronoun is attached to something in your head much more firmly than 'doesn't look like an Oliver' is attached to something in your head." How else am I supposed to interpret those words?
  
-Perhaps it's not justified to question Yudkowsky's "I do not know what it feels like [...]" self-report based on generalizations about English speakers in general? Maybe his mind works differently, but dint of unusual neurodiversity or training in LambdaMOO? But if so, one would perhaps expect some evidence of this in his publicly observable writing? And yet, on the contrary, looking over his works, we can see instances of Yudkowsky treating pronouns as synonymous with sex/gender (just as one would expect a native English speaker born in 1979 to do), contrary to his 2021 self-report of not knowing what this feels like from the inside.
+Perhaps it's not justified to question Yudkowsky's "I do not know what it feels like [...]" self-report based on generalizations about English speakers in general? Maybe his mind works differently, but dint of unusual neurodiversity or training in LambdaMOO? But if so, one would perhaps expect some evidence of this in his publicly observable writing? And yet, on the contrary, looking over his works, we can see instances of Yudkowsky treating pronouns as synonymous with sex (just as one would expect a native English speaker born in 1979 to do), contrary to his 2021 self-report of not knowing what this feels like from the inside.
  
  For example, in Yudkowsky's 2001 _Creating Friendly AI: The Analysis and Design of Benevolent Goal Architectures_, the text "If a human really hates someone, she" is followed by [footnote 16](https://web.archive.org/web/20070615130139/http://singinst.org/upload/CFAI.html#foot-15): "I flip a coin to determine whether a given human is male or female." Note, "_is_ male or female", not "which pronoun to use." The text would seem to reflect the common understanding that _she_ and _he_ do imply sex specifically (and not some other thing, like being named Oliver), even if flipping a coin (and drawing attention to having done so) reflects annoyance that English requires a choice.
  
@@ -179,7 +203,7 @@ A perhaps starker example comes in the comments to Yudkowsky's 2009 short story
  >
  > Sometimes a random number generator only tells you what you already know.
  
-But the text of the story doesn't _say_ Aerhien isn't a "man"; it merely refers to her with she/her pronouns! If Yudkowsky "couldn't make [the character] a man", but the only unambiguous in-text consequence of this is that the chacter takes she/her pronouns, that would seem to be treating sex and pronouns as synonymous; the comment _only makes sense_ if Yudkowsky thinks the difference between _she_ and _he_ is semantically meaningful. (It's possible that he changed his mind about this between 2009 and 2021, but if so, you'd expect the 2021 Facebook discussion to explain why he changed his mind, rather than claiming that he "do[es] not know what it feels like from the inside" to hold the position implied by his 2009 comments.)
+But the text of the story doesn't _say_ Aerhien isn't a "man"; it merely refers to her with she/her pronouns! If Yudkowsky "couldn't make [the character] a man", but the only unambiguous in-text consequence of this is that the character takes she/her pronouns, that would seem to be treating sex and pronouns as synonymous; the comment _only makes sense_ if Yudkowsky thinks the difference between _she_ and _he_ is semantically meaningful. (It's possible that he changed his mind about this between 2009 and 2021, but if so, you'd expect the 2021 Facebook discussion to explain why he changed his mind, rather than claiming that he "do[es] not know what it feels like from the inside" to hold the position implied by his 2009 comments.)
  
  In the Facebook comments, Yudkowsky continues:
  
@@ -197,9 +221,9 @@ Personally, I have a _lot_ of sympathy for this, because in an earlier stage of
  
  But it's important to not use sympathy as an excuse to blur together different rationales, or obfuscate our analysis of the costs and benefits to different parties of different policies. "Systematically de-gender English because that's a superior language design" and "Don't misgender trans people because trans people are sympathetic" are _different_ political projects with different victory conditions: victory for the de-genderers would mean singular _they_ or similar for everyone (as a matter of language design, no idiosyncratic personal exceptions), which is very different from the [ask-and-share-pronouns norms](https://www.mypronouns.org/asking) championed by contemporary trans rights activists.
  
-Perhaps it might make sense for adherents of a "degender English" movement to stategically _ally_ with the trans rights movement: to latch on to gender-dysphoric people's pain as a political weapon to destabilize what the English-degenderers think of as a bad pronoun system for _other reasons_. Fine.
+Perhaps it might make sense for adherents of a "degender English" movement to strategically _ally_ with the trans rights movement: to latch on to gender-dysphoric people's pain as a political weapon to destabilize what the English-degenderers think of as a bad pronoun system for _other reasons_. Fine.
  
-But if that's the play you want to make, you forfeit the right to _honestly_ claim that your stance is that "feelings don't get to control everybody's language protocol". If you piously proclaim that the "important thing" is trans people's feelings of "not lik[ing] to be tossed into a Male Bucket or Female Bucket, as it would be assigned by their birth certificate", that would seem, pretty straightforwardly, to be participating in an attempt to let someone's feelings control everybody's language protocol! Again, I'm really not sure how else I'm supposed to interpret those words!
+But if that's the play you want to make, you forfeit the right to _honestly_ claim that your stance is that "feelings don't get to control everybody's language protocol". If you piously proclaim that the "important thing" is trans people's feelings of "not lik[ing] to be tossed into a Male Bucket or Female Bucket, as it would be assigned by their birth certificate", that would seem, pretty straightforwardly, to be participating in an attempt to make it so that "[someone's] feelings [...] get to control everybody's language protocol"! Again, how else am I supposed to interpret those words?
  
  There's nothing _inconsistent_ about believing that trans people's feelings matter, and that the feelings of people who resent the Stroop-like effect of having to speak in a way that contradicts their own sex-category perceptions, don't matter. (Or don't matter _as much_, quantitatively, under the utilitarian calculus.) But if that were your position, the intellectually honest thing to tell people like Barra Kerr is, "Sorry, I'm participating in a political coalition that believes that trans people's feelings are more important than yours with respect to this policy question; sucks to be you", rather than haughtily implying that people like Kerr are making an elementary philosophy mistake that they are _clearly not making_ if you _actually read what they write_.
  
@@ -207,7 +231,7 @@ There's nothing _inconsistent_ about believing that trans people's feelings matt
  
  All this having been said, Yudkowsky _is_ indeed correct to note that "when different people with firm attachments have _different_ firm attachments [...] we can't make them all be protocol". It's possible for observers to disagree about what sex category they see someone as belonging to, and it would be awkward at best for different speakers in a conversation to use different pronouns to refer to the same subject.
  
-As it happens, I think this _is_ an important consideration in favor of self-identity pronouns! [When different parties disagree about what category something should belong to, but want to coordinate to use the _same_ category, they tend to find some mutually-salient Schelling point to settle the matter.](https://www.lesswrong.com/posts/edEXi4SpkXfvaX42j/schelling-categories-and-simple-membership-tests) In the case of disagreements about a person's social sex category ("gender"), in the absence of a trusted central authority to break the symmetry among third parties' judgements (like a priest or rabbi in a tight-knit religious community, or a medical bureaucracy with the social power to diagnose who is "legitimately" transsexual), the most obvious Schelling point is to defer to the person themselves. I wrote about this argument in a previous post, ["Self-Identity Is a Schelling Point"](/2019/Oct/self-identity-is-a-schelling-point/).
+As it happens, I think this _is_ an important consideration in favor of self-identity pronouns! [When different parties disagree about what category something should belong to, but want to coordinate to use the _same_ category, they tend to find some mutually-salient Schelling point to settle the matter.](https://www.lesswrong.com/posts/edEXi4SpkXfvaX42j/schelling-categories-and-simple-membership-tests) In the case of disagreements about a person's social sex category ("gender"), in the absence of a trusted central authority to break the symmetry among third parties' judgments (like a priest or rabbi in a tight-knit religious community, or a medical bureaucracy with the social power to diagnose who is "legitimately" transsexual), the most obvious Schelling point is to defer to the person themselves. I wrote about this argument in a previous post, ["Self-Identity Is a Schelling Point"](/2019/Oct/self-identity-is-a-schelling-point/).
  
  But crucially, the fact that the self-identity convention is a Schelling point, _doesn't_ mean we have a [one-sided policy debate](https://www.lesswrong.com/posts/PeSzc9JTBxhaYRp9b/policy-debates-should-not-appear-one-sided) where it's in everyone's interests to support this "simplest and best protocol", with no downsides or trade-offs for anyone. The thing where _she_ and _he_ (which we don't know how to coordinate a jump away from) imply sex category inferences to actually-existing English speakers is still true! The Schelling point argument just means that the setup of the social-choice problem that we face happens to grant a structural advantage to those who favor the self-identity convention.
  
@@ -215,7 +239,7 @@ Although they're not the only ones with an structural advantage: a social order
  
  Still, I think most people reading this post _are_ "moderates" in this sense. Schelling points are powerful. If we're _not_ culturally-genocidal extremists who want to exclude transsexuals from Society (and therefore reject the "pronouns = sex, no exceptions" Schelling point), isn't it reasonable that we end up at the self-identity Schelling point—at least as far as the trivial courtesy of pronouns is concerned, even if some of the moderates want to bargain for the right to use natal-sex categories in some contexts?
  
-Sure. Yes. And indeed, I don't misgender people! (In public. Only rarely in private, when someone's transition doesn't seem legitimate or serious to me, or when talking to my politically reactionary friends.) I'm not arguing that Yudkowsky should misgender people! The purpose of this post is not to argue with Yudkowsky's pronoun usage, but rather to argue with the offered usage _rationale_ that "the simplest and best protocol is, '"He" refers to the set of people who have asked us to use "he", with a default for those-who-haven't-asked that goes by gamete size' and to say that this just _is_ the normative definition."
+Sure. Yes. And indeed, I don't misgender people! (In public. Only rarely in private, when someone's transition doesn't seem legitimate or serious to me, and the person I'm talking to doesn't seem liable to object.) I'm not arguing that Yudkowsky should misgender people! The purpose of this post is not to argue with Yudkowsky's pronoun usage, but rather to argue with the offered usage _rationale_ that "the simplest and best protocol is, '"He" refers to the set of people who have asked us to use "he", with a default for those-who-haven't-asked that goes by gamete size' and to say that this just _is_ the normative definition."
  
  As I have explained at length, this _rationale_ doesn't work and isn't true (even if better rationales, like sincere belief in gender identity, or the Schelling point argument, can end up recommending the same behavior). _No one_ actually believes (as contrasted to [believing that they believe](https://www.lesswrong.com/posts/CqyJzDZWvGhhFJ7dY/belief-in-belief)) that _she_ and _he_ aren't attached to gender in people's heads, despite Yudkowsky's sneering claim in the comments that he ["would not know how to write a different viewpoint as a sympathetic character."](https://www.facebook.com/yudkowsky/posts/10159421750419228?comment_id=10159421986539228&reply_comment_id=10159423713134228)
  
@@ -239,7 +263,7 @@ A cis woman is testifying in court about a brutal rape that horrifically traumat
  
  "Oh. O–okay. And then she took her—" The victim breaks down crying. "I'm sorry, Your Honor; I can't do it. I'm under oath; I have to tell the story the way it happened to me. In my memories, the person who did those things to me was a man. A—"
  
-She hesistates, sobs a few more times. In this moment, almost more than the memories of the rape, she is very conscious of having never gone to college. The judge and the defense lawyer are smarter and more educated than her, and _they_ believe that the man who raped her is now (or perhaps, always had been) a woman. It had never made any sense to her—but how could she explain to an authority figure who she had no hope of out-arguing, even if she was even allowed to argue?
+She hesitates, sobs a few more times. In this moment, almost more than the memories of the rape, she is very conscious of having never gone to college. The judge and the defense lawyer are smarter and more educated than her, and _they_ believe that the man who raped her is now (or perhaps, always had been) a woman. It had never made any sense to her—but how could she explain to an authority figure who she had no hope of out-arguing, if she was even allowed to argue?
  
  "And by 'man', I mean—a male. The way I was raised, men—males—get called _he_ and _him_. If I say _she_, it doesn't feel true to the memory in my head. It—it feels like lying, Your Honor."
  
@@ -247,9 +271,9 @@ The judge scoffs. "You are _ontologically_ confused," he sneers. "At age 13 I wa
  
  "O-okay," says the victim. She doesn't know what _ontologically_ means, or what a LambdaMOO is. "So then—then sh-she took her erect penis and she—"
  
-She breaks down crying again. "Your Honor, I can't! I can't do it! It's not true! It's not—" She senses that the judge will imply she's stupid for saying it's not true. She gropes for some way of explaining. "I mean—the Court allows people to testify in Spanish or Chinese with the help of a translator, right? Can't you treat my testimony like that? Let me say what happened to me in the words that seems true to me, even if the court does its business using words in a different way?"
+She breaks down crying again. "Your Honor, I can't! I can't do it! It's not true! It's not—" She senses that the judge will imply she's stupid for saying it's not true. She gropes for some way of explaining. "I mean—the Court allows people to testify in Spanish or Chinese with the help of a translator, right? Can't you treat my testimony like that? Let me say what happened to me in the words that seem true to me, even if the court does its business using words in a different way?"
  
-"You're in contempt," says the judge. "Baliff! Take her away!"
+"You're in contempt," says the judge. "Bailiff! Take her away!"
  
  <p class="flower-break">⁕ ⁕ ⁕</p>
  
@@ -257,21 +281,17 @@ Not a sympathetic character? Not even a little bit?
  
  I suspect some readers will have an intuition that my choice of scenario is loaded, unfair, or unrealistic. To be sure, I chose it an unusually clear-cut case for why someone might have a need to use pronouns to imply sex in their _own_ speech. (If the scenario was just talking about someone borrowing a vacuum cleaner, fewer readers would have any sympathy for someone not wanting to concede the trivial courtesy of preferred pronouns.)
  
-But what, specifically, is unrealistic about it? Is it the idea that a trans woman could have raped someone before transitioning?
-
-[TODO: cite that this is real]
+But what, specifically, is unrealistic about it? Is it the idea that a trans woman could have raped someone before transitioning? Of course _most_ trans women are not sex offenders—just as _most_ non-transsexual males are not sex offenders—but instances of trans women committing the kinds of sex crimes that are overwhelmingly the provenance of men [are a documented thing](https://fairplayforwomen.com/transgender-male-criminality-sex-offences/).
  
-Is it the idea that the legal system would penalize someone for pronoun non-compliance?
-
-[TODO: cite examples of this happening; as liberal intellectuals, we want to debate the optimal communication policy and expect to govern by assent; we're not so bloodthirsty as to want to throw dissenters in jail—but that is potentially what's at stake; the judge actually does have a forced choice between sustained/overruled]
+Is it the idea that the legal system would penalize someone for pronoun non-compliance? But this is also an occasionally documented thing, as in [one case where a Canadian father was jailed](https://www.city-journal.org/canadian-father-jailed-for-speaking-out-about-trans-identifying-child) for violating [a court order](https://www.bccourts.ca/jdb-txt/sc/19/06/2019BCSC0604.htm) not to refer to his natal-female child with she/her pronouns. As liberal intellectuals debating optimal communication policies, we usually hope to govern by consensus: we want people to use preferred pronouns _voluntarily_, rather than being forced. But maintaining a collective norm in the face of those who have their own reasons to object to it, does ultimately require some sort of threat. In the vignette above, given the defense lawyer's objection; the judge does face a forced choice to Sustain or Overrule, and that choice has consequences either way.
  
  In the comments, Yudkowsky continues:
  
  > This is _not_ the woke position. The woke position is that when you call somebody "she" because she requested "she", you're validating her gender preference. I may SEPARATELY be happy to validate somebody's gender preference by using the more complex language feature of NOUN PHRASES to construct an actual SENTENCE that refers to her ON PURPOSE as a "woman", but when it comes to PRONOUNS I am not even validating anyone.
  
-Right; it's not the woke position. It's an _incoherent_ position that's optimized to concede to the woke the behavior that they want for a _different stated reason_ in order to make the concession appear "neutral" and not "politically" motivated. She requested "she" _because_ acceding to the request validates her gender preference in the minds of all native English speakers who are listening, even if Eliezer Yudkowsky has some clever casuistry for why it magically doesn't mean that when _he_ says it.
+Right, it's not the woke position. It's an _incoherent_ position that's optimized to concede to the woke the behavior that they want for a _different stated reason_ in order to make the concession appear "neutral" and not "politically" motivated. She requested "she" _because_ acceding to the request validates her gender preference in the minds of all native English speakers who are listening, even if Eliezer Yudkowsky has some clever casuistry for why it magically doesn't mean that when _he_ says it.
  
-I'm _not_ saying that Yudkowsky should have a different pronoun policy. (I agree that misgendering all trans people "on principle" seems very wrong and unappealing.) Rather, I'm saying that in order to _actually_ be politically neutral in your analysis of _why_ someone might choose one pronoun policy over another, you need to _acknowledge_ the costs and benefits of a policy to different parties, and face the unhappy fact that sometimes there are cases where there _is_ no "neutral" policy, because all available policies impose costs on _someone_ and there's no solution that everyone is happy with. (Rational agents can hope to reach _some_ point on the Pareto frontier, but non-identical agents are necessarily going to fight about _which_ point, even if most of the fighting takes place in non-realized counterfactual possible worlds rather than exerting costs in reality.)
+Again, I'm _not_ saying that Yudkowsky should have a different pronoun policy. (I agree that misgendering all trans people "on principle" seems very wrong and unappealing.) Rather, I'm saying that in order to _actually_ be politically neutral in your analysis of _why_ someone might choose one pronoun policy over another, you need to _acknowledge_ the costs and benefits of a policy to different parties, and face the unhappy fact that sometimes there are cases where there _is_ no "neutral" policy, because all available policies impose costs on _someone_ and there's no solution that everyone is happy with. (Rational agents can hope to reach _some_ point on the Pareto frontier, but non-identical agents are necessarily going to fight about _which_ point, even if most of the fighting takes place in non-realized counterfactual possible worlds rather than exerting costs in reality.)
  
  Policy debates should not appear one-sided. Exerting social pressure on (for example) a native-English-speaking rape victim to refer to her male rapist with _she_/_her_ pronouns is a _cost_ to her. And, simultaneously, _not_ exerting that pressure is a _cost_ to many trans people, by making recognition of their social gender _conditional_ on some standard of good behavior, rather than an unconditional fact that doesn't need to be "earned" or justified in any way.
  
@@ -279,7 +299,7 @@ You might think the cost of making the rape victim say _she_ is worth it, becaus
  
  Fine. That's a perfectly coherent position. But if that's your position and you care about being intellectually honest, you need to _acknowledge_ that your position exerts costs on some actually-existing English speakers who have a use-case for using pronouns to imply sex. You need to be able to look that rape victim in the eye and say, "Sorry, I'm participating in a political coalition that believes that trans people's feelings are more important than yours with respect to this policy question; sucks to be you."
  
-And of course—it _should_ be needless to say—this applies symmetrically. If you think speakers _should_ be able to misgender according to their judgement and you care about being intellectually honest, you need to be able to look a trans person in the eye and say, "Sorry, I'm participating in a political coalition that believes the freedom of speech of speakers is more important than your gender being recognized; sucks to be you."
+And of course—it _should_ be needless to say—this applies symmetrically. If you think speakers _should_ be able to misgender according to their judgment and you care about being intellectually honest, you need to be able to look a trans person in the eye and say, "Sorry, I'm participating in a political coalition that believes the freedom of speech of speakers is more important than your gender being recognized; sucks to be you."
  
  Or if you have more important things to worry about (like the fate of a hundred thousand galaxies depending on the exact preferences built into the first artificial superintelligence) and don't want the distraction of taking a position on controversial contemporary social issues, fine: use whatever pronoun convention happens to be dominant in your local social environment, and, if questioned, say, "I'm using the pronoun convention that happens to be dominant in my local social environment." You don't have to invent _absurd lies_ to make it look like the convention that happens to be dominant in your local social environment has no costs.
  
@@ -297,251 +317,20 @@ I guess for me, the issue is that this is a question where _I need the correct r
  
  This debate looks very different depending on whether you're coming into it as someone being _told_ that you need to change your pronoun usage for the sake of someone who will be very hurt if you don't—or whether you're in the position of wondering whether it makes sense to _make_ such a request of others.
  
-As a good cis ally, you're told that trans people know who they are and you need to respect that [on pain of being responsible for someone's suicide](/2018/Jan/dont-negotiate-with-terrorist-memeplexes/). While politically convenient for people who have _already_ transitioned and don't want anyone second-guessing their identity, I think this view is actually false. Humans don't have an atomic "gender identity" that they just _know_, which has no particular properties other than it not being recognized by others being worse than death. Rather, there are a variety of reasons why someone might feel sad about being the sex that they are, and wish they could be the other sex instead, which is called "gender dysphoria."
+As a good cis ally, you're told that trans people know who they are and you need to respect that [on pain of being responsible for someone's suicide](/2018/Jan/dont-negotiate-with-terrorist-memeplexes/). While politically convenient for people who have _already_ transitioned and don't want anyone second-guessing their identity, I think this view is actually false. Humans don't have an atomic "gender identity" that they just _know_, which has no particular properties other than it being worse than death for it to not be recognized by others. Rather, there are a variety of reasons why someone might feel sad about being the sex that they are, and wish they could be the other sex instead, which is called "gender dysphoria."
  
  Fortunately, our Society has interventions available to approximate changing sex as best we can with existing technology: you can get hormone replacement therapy (HRT), genital surgery, ask people to call you by a different name, ask people to refer to you with different pronouns, get new clothes, get other relevant cosmetic surgeries, _&c._ In principle, it's possible to pick and choose some of these interventions piecemeal—[I actually tried just HRT for five months in 2017](http://unremediatedgender.space/tag/hrt-diary/)—but it's more common for people to "transition", to undergo a correlated _bundle_ of these interventions to approximate a sex change.
  
  On this view, there's not a pre-existing fact of the matter as to whether someone "is trans" as an atomic identity. Rather, gender-dysphoric people have [the option to _become_ trans](https://thingofthings.wordpress.com/2016/04/11/1327/) by means of undergoing the bundle of interventions that constitute transitioning, if they think it will make their life better. But in order for a gender-dysphoric person to _decide_ whether transitioning is a good idea with benefits that exceed the costs, they need _factually accurate information_ about the nature of their dysphoria and each of the component interventions.
  
-If people in a position of intellectual authority provide _inaccurate_ information about transitioning interventions, that's making the lives of gender-dysphoric people worse, because agents with less accurate information make worse decisions (in expectation): if you have the facts wrong, you might wrongly avoid an intervention that would have benefitted you, or wrongly undergo an intevention that harms you.
+If people in a position of intellectual authority provide _inaccurate_ information about transitioning interventions, that's making the lives of gender-dysphoric people worse, because agents with less accurate information make worse decisions (in expectation): if you have the facts wrong, you might wrongly avoid an intervention that would have benefited you, or wrongly undergo an intervention that harms you.
  
-For example, I think my five-month HRT experiment was a _good_ decision—I benefitted from the experience and I'm very glad I did it, even though I didn't end up staying on HRT long term. The benefits (satisfied curiosity about the experience, breast tissue) exceeded the costs (a small insurance co-pay, sitting through some gatekeeping sessions, the inconvenience of [wearing a patch](/2017/Jan/hormones-day-33/) or [taking a pill](/2017/Jul/whats-my-motivation-or-hormones-day-89/), [various slight medical risks including to future fertility](https://srconstantin.github.io/2016/10/06/cross-sex-hormone-therapy.html)).
+For example, I think my five-month HRT experiment was a _good_ decision—I benefited from the experience and I'm very glad I did it, even though I didn't end up staying on HRT long term. The benefits (satisfied curiosity about the experience, breast tissue) exceeded the costs (a small insurance co-pay, sitting through some gatekeeping sessions, the inconvenience of [wearing a patch](/2017/Jan/hormones-day-33/) or [taking a pill](/2017/Jul/whats-my-motivation-or-hormones-day-89/), [various slight medical risks including to future fertility](https://srconstantin.github.io/2016/10/06/cross-sex-hormone-therapy.html)).
  
-If someone I trusted as an intellectual authority had falsely told me that HRT makes you go blind and lose the ability to hear music, _and I were dumb enough to believe them_, then I wouldn't have done it, and I would have missed out on something that benefitted me. Such an authority figure would be harming me by means of giving me bad information; I'd be better off if I hadn't trusted them to tell me the truth.
+If someone I trusted as an intellectual authority had falsely told me that HRT makes you go blind and lose the ability to hear music, _and I were dumb enough to believe them_, then I wouldn't have done it, and I would have missed out on something that benefited me. Such an authority figure would be harming me by means of giving me bad information; I'd be better off if I hadn't trusted them to tell the truth.
  
  In contrast, I think asking everyone in my life to use she/her pronouns for me would be an _obviously incredibly bad decision_. Because—notwithstanding my clean-shavenness and beautiful–beautiful ponytail and slight gynecomastia from that HRT experiment five years ago—anyone who looks at me can see at a glance that I'm male (as a _fact_ about the real world, however I feel about it). People would comply because they felt obligated to (and apologize profusely when they slipped up), but it wouldn't come naturally, and strangers would always get it wrong without being told—_in accordance with_ the "default for those-who-haven't-asked that goes by gamete size" clause of Yudkowsky's reform proposal, but really because pronouns are firmly attached to sex in their heads. The costs (this tremendous awkwardness and fakeness suffusing _all future social interactions involving me_) would exceed the benefits (I actually do feel happier about the word _she_).
  
-I used to trust Yudkowsky as an intellectual authority; his [Sequences](https://www.readthesequences.com/) from the late 'aughts were so life-alteringly great that I built up a trust that if Eliezer Yudkowsky said something, that thing was probably so, even if I didn't immediately understand why. But these days, Yudkowsky is telling me that 'she' normatively refers to the set of people who have asked us to use 'she', and that those who disagree are engaging in logically rude Shenanigans. However, as I have just explained at length, this is bullshit. (Declaring a "normative" meaning on your Facebook wall doesn't rewrite the _actual_ meaning embodied the brains of 370 million English speakers.) If I were _dumb enough to believe him_, I might ask people for new pronouns, which would obviously be an incredibly bad decision. Yudkowsky is harming a reference class of people that includes more naïve versions of me by giving them bad information; I'm better off because I don't trust Eliezer Yudkowsky to tell me the truth.
+I used to trust Yudkowsky as an intellectual authority; his [Sequences](https://www.readthesequences.com/) from the late 'aughts were so life-alteringly great that I built up a trust that if Eliezer Yudkowsky said something, that thing was probably so, even if I didn't immediately understand why. But these days, Yudkowsky is telling me that 'she' normatively refers to the set of people who have asked us to use 'she', and that those who disagree are engaging in logically rude Shenanigans. However, as I have just explained at length, this is bullshit. (Declaring a "normative" meaning on your Facebook wall doesn't rewrite the _actual_ meaning encoded in the brains of 370 million English speakers.) If I were _dumb enough to believe him_, I might ask people for new pronouns, which would obviously be an incredibly bad decision. (It might be a _less_ bad decision if done in conjunction with a serious gender transition effort, but Yudkowsky's pronoun reform proposal doesn't _say_ "she" is the pronoun for fully-transitioned trans women; it just says you have to ask.) Yudkowsky is harming a reference class of people that includes more naïve versions of me by giving them bad information; I'm better off because I don't trust Eliezer Yudkowsky to tell the truth.
  
  (I guess I [can't say I wasn't warned](https://www.lesswrong.com/posts/wustx45CPL5rZenuo/no-safe-defense-not-even-science).)
-
------
-
-If Yudkowsky is obviously playing dumb (consciously or not) and his comments can't be taken seriously, what's _actually_ going on here?
-
-When smart people act dumb, [it's usually wisest to assume that their behavior represents _optimized_ stupidity](https://www.lesswrong.com/posts/sXHQ9R5tahiaXEZhR/algorithmic-intent-a-hansonian-generalized-anti-zombie)—apparent "stupidity" that achieves a goal through some other channel than their words straightforwardly reflecting the truth. Someone who was _actually_ stupid wouldn't be able to generate text with a specific balance of insight and selective stupidity fine-tuned to reach a gender-politically convenient conclusion without explicitly invoking any controversial gender-political reasoning.
-
-Fortunately, Yudkowsky graciously grants us a clue in the form of [a disclaimer comment](https://www.facebook.com/yudkowsky/posts/10159421750419228?comment_id=10159421833274228):
-
-> It unfortunately occurs to me that I must, in cases like these, disclaim that—to the extent there existed sensible opposing arguments against what I have just said—people might be reluctant to speak them in public, in the present social atmosphere. [...]
->
-> This is a filter affecting your evidence; it has not to my own knowledge filtered out a giant valid counterargument that invalidates this whole post. I would have kept silent in that case, for to speak then would have been dishonest.
->
-> Personally, I'm used to operating without the cognitive support of a civilization in controversial domains, and have some confidence in my own ability to independently invent everything important that would be on the other side of the filter and check it myself before speaking. So you know, from having read this, that I checked all the speakable and unspeakable arguments I had thought of, and concluded that this speakable argument would be good on net to publish, as would not be the case if I knew of a stronger but unspeakable counterargument in favor of Gendered Pronouns For Everyone and Asking To Leave The System Is Lying.
->
-> But the existence of a wide social filter like that should be kept in mind; to whatever quantitative extent you don't trust your ability plus my ability to think of valid counterarguments that might exist, as a Bayesian you should proportionally update in the direction of the unknown arguments you speculate might have been filtered out.
-
-So, the explanation of [the problem of political censorship filtering evidence](https://www.lesswrong.com/posts/DoPo4PDjgSySquHX8/heads-i-win-tails-never-heard-of-her-or-selective-reporting) here is great, but the part where Yudkowsky claims "confidence in [his] own ability to independently invent everything important that would be on the other side of the filter" is just _laughable_. My point that _she_ and _he_ have existing meanings that you can't just ignore by fiat given that the existing meanings are _exactly_ what motivate people to ask for new pronouns in the first place is _really obvious_.
-
-Really, it would be _less_ embarassing for Yudkowsky if he were outright lying about having tried to think of counterarguments. The original post isn't _that_ bad if you assume that Yudkowsky was writing off the cuff, that he clearly just _didn't put any effort whatsoever_ into thinking about why someone might disagree. If he _did_ put in the effort—enough that he felt comfortable bragging about his ability to see the other side of the argument—and _still_ ended up proclaiming his "simplest and best protocol" without even so much as _mentioning_ any of its incredibly obvious costs ... that's just _pathetic_. If Yudkowsky's ability to explore the space of arguments is _that_ bad, why would you trust his opinion about _anything_?
-
-But perhaps it's premature to judge Yudkowsky without appreciating what tight constraints he labors under. The disclaimer comment mentions "speakable and unspeakable arguments"—but what, exactly, is the boundary of the "speakable"? In response to a commenter mentioning the cost of having to remember pronouns as a potential counterargument, Yudkowsky [offers us another clue](https://www.facebook.com/yudkowsky/posts/10159421750419228?comment_id=10159421833274228&reply_comment_id=10159421871809228):
-
-> People might be able to speak that. A clearer example of a forbidden counterargument would be something like e.g. imagine if there was a pair of experimental studies somehow proving that (a) everybody claiming to experience gender dysphoria was lying, and that (b) they then got more favorable treatment from the rest of society. We wouldn't be able to talk about that. No such study exists to the best of my own knowledge, and in this case we might well hear about it from the other side to whom this is the exact opposite of unspeakable; but that would be an example.
-
-(As an aside, the wording of "we might well hear about it from _the other side_" (emphasis mine) is _very_ interesting, suggesting that the so-called "rationalist" community, is, effectively, a partisan institution, despite its claims to be about advancing the generically human art of systematically correct reasoning.)
-
-I think (a) and (b) _as stated_ are clearly false, so "we" (who?) fortunately aren't losing much by allegedly not being able to speak them. But what about some _similar_ hypotheses, that might be similarly unspeakable for similar reasons?
-
-Instead of (a), consider the claim that (a′) self-reports about gender dysphoria are substantially distorted by [socially-desirable responding tendencies](https://en.wikipedia.org/wiki/Social-desirability_bias)—as a notable and common example, heterosexual males with [sexual fantasies about being female](http://www.annelawrence.com/autogynephilia_&_MtF_typology.html) [often falsely deny or minimize the erotic dimension of their desire to change sex](/papers/blanchard-clemmensen-steiner-social_desirability_response_set_and_systematic_distortion.pdf) (The idea that self-reports can be motivatedly inaccurate without the subject consciously "lying" should not be novel to someone who co-blogged with [Robin Hanson](https://en.wikipedia.org/wiki/The_Elephant_in_the_Brain) for years!)
-
-And instead of (b), consider the claim that (b′) transitioning is socially rewarded within particular _subcultures_ (although not Society as a whole), such that many of the same people wouldn't think of themselves as trans or even gender-dysphoric if they lived in a different subculture.
-
-I claim that (a′) and (b′) are _overwhelmingly likely to be true_. Can "we" talk about _that_? Are (a′) and (b′) "speakable", or not?
-
-We're unlikely to get clarification from Yudkowsky, but based on my experiences with the so-called "rationalist" community over the past coming-up-on-six years—the Whole Dumb Story of which might need to be the topic of _another_ future multi-thousand-word blog post, which I've found difficult to write, because it still hurts—I'm going to _guess_ that the answer is broadly No: no, "we" can't talk about that.
-
-But if I'm right that (a′) and (b′) should be live hypotheses and that Yudkowsky would consider them "unspeakable", that means "we" can't talk about what's _actually going on_ with gender dysphoria and transsexuality, which puts the whole discussion in a different light. In another comment, Yudkowsky lists some gender-transition interventions he named in [a November 2018 Twitter thread](https://twitter.com/ESYudkowsky/status/1067183500216811521) that was the precursor to the present discussion—using a different bathroom, changing one's name, asking for new pronouns, and getting sex reassignment surgery—and notes that none of these are calling oneself a "woman". [He continues](https://www.facebook.com/yudkowsky/posts/10159421750419228?comment_id=10159421986539228&reply_comment_id=10159424960909228):
-
-> [Calling someone a "woman"] _is_ closer to the right sort of thing _ontologically_ to be true or false. More relevant to the current thread, now that we have a truth-bearing sentence, we can admit of the possibility of using our human superpower of language to _debate_ whether this sentence is indeed true or false, and have people express their nuanced opinions by uttering this sentence, or perhaps a more complicated sentence using a bunch of caveats, or maybe using the original sentence uncaveated to express their belief that this is a bad place for caveats. Policies about who uses what bathroom also have consequences and we can debate the goodness or badness (not truth or falsity) of those policies, and utter sentences to declare our nuanced or non-nuanced position before or after that debate.
->
-> Trying to pack all of that into the pronouns you'd have to use in step 1 is the wrong place to pack it.
-
-Sure, _if we were in the position of designing a constructed language from scratch_ under current social conditions in which a person's "gender" is a contested social construct, rather than their sex an objective and undisputed fact, then yeah: in that situation _which we are not in_, you definitely wouldn't want to pack sex or gender into pronouns. But it's a disingenuous derailing tactic to grandstand about how people need to alter the semantics of their _already existing_ native language so that we can discuss the real issues under an allegedly superior pronoun convention when, _by your own admission_, you have _no intention whatsoever of discussing the real issues!_
-
-(Lest the "by your own admission" clause seem too accusatory, I should note that given constant behavior, admitting it is _much_ better than not-admitting it; so, huge thanks to Yudkowsky for the transparency on this point!)
-
-Again, a comparison to the _tú_/_usted_ distinction is instructive. It's one thing to advocate for collapsing the distinction and just settling on one second-person singular pronoun for the Spanish language. That's principled.
-
-It's quite another thing altogether to _simultaneously_ try to prevent a speaker from using _tú_ to indicate disrespect towards a social superior (on the stated rationale that the _tú_/_usted_ distinction is dumb and shouldn't exist), while _also_ refusing to entertain or address the speaker's arguments explaining _why_ they think their interlocutor is unworthy of the deference that would be implied by _usted_ (because such arguments are "unspeakable" for political reasons). That's just psychologically abusive.
-
-If Yudkowsky _actually_ possessed (and felt motivated to use) the "ability to independently invent everything important that would be on the other side of the filter and check it [himself] before speaking", it would be _obvious_ to him that "Gendered Pronouns For Everyone and Asking To Leave The System Is Lying" isn't the hill anyone would care about dying on if it weren't a Schelling point. A lot of TERF-adjacent folk would be _overjoyed_ to concede the (boring, insubstantial) matter of pronouns as a trivial courtesy if it meant getting to _actually_ address their real concerns of "Biological Sex Actually Exists", and ["Biological Sex Cannot Be Changed With Existing or Foreseeable Technology"](https://www.lesswrong.com/posts/QZs4vkC7cbyjL9XA9/changing-emotions) and "Biological Sex Is Sometimes More Relevant Than Self-Declared Gender Identity." The reason so many of them are inclined to stand their ground and not even offer the trivial courtesy is because they suspect that the matter of pronouns is being used as a rhetorical wedge and typographical attack to try to prevent people from talking or thinking about sex.
-
-And this suspicion seems broadly accurate! _After_ having been challenged on it, Yudkowsky can try to spin his November 2018 Twitter comments as having been a non-partisan matter of language design ("Trying to pack all of that into the pronouns [...] is the wrong place to pack it"), but when you read the text that was actually published at the time, parts of it are hard to read as anything other than an attempt to intimidate and delegitimize people who want to use language to reason about sex rather than gender identity. [For example](https://twitter.com/ESYudkowsky/status/1067490362225156096):
-
-> The more technology advances, the further we can move people towards where they say they want to be in sexspace. Having said this we've said all the facts. Who competes in sports segregated around an Aristotelian binary is a policy question (that I personally find very humorous).
-
-Sure, _in the limit of arbitrarily advanced technology_, everyone could be exactly where they wanted to be in sexpsace. Having said this, we have _not_ said all the facts relevant to decisionmaking in our world, where _we do not have arbitrarily advanced technology_. As Yudkowsky [acknowledges in the previous Tweet](https://twitter.com/ESYudkowsky/status/1067488844122021888), "Hormone therapy changes some things and leaves others constant." The existence of HRT does not take us into the Glorious Transhumanist Future where everyone is the sex they say they are.
-
-Rather, previously sexspace had two main clusters (normal females and males) plus an assortment of tiny clusters corresponding to various [disorders of sex development](https://en.wikipedia.org/wiki/Disorders_of_sex_development), and now it has two additional tiny clusters: females-on-masculinizing-HRT and males-on-feminizing-HRT. Certainly, there are situations where you would want to use "gender" categories that use the grouping {females, males-on-feminizing-HRT} and {males, females-on-masculinizing-HRT}.
-
-But the _reason_ for having sex-segregated sports leagues is because the sport-relevant multivariate trait distributions of female bodies and male bodies are quite different.
-
-[TODO: (clean up and consolidate the case here after reading the TW-in-sports articles)
-
-The "multivariate" part is important, because
-
-Different traits have different relevance to different sports; the fact that it's apples-to-oranges is _why_ women do better in ultraswimming—that competition is sampling a corner of sportspace where body fat is an advantage
-
-It's not that females and males are exactly the same except males are 10% stronger on average
-
-It really is an apples-to-oranges comparison, rather than "two populations of apples with different mean weight"
-
-https://www.lesswrong.com/posts/cu7YY7WdgJBs3DpmJ/the-univariate-fallacy
-https://www.lesswrong.com/posts/vhp2sW6iBhNJwqcwP/blood-is-thicker-than-water
-
-If you just had one integrated league, females wouldn't be competitive (in almost all sports, with some exceptions [like ultra-distance swimming](https://www.swimmingworldmagazine.com/news/why-women-have-beaten-men-in-marathon-swimming/)).
-
-]
-
-Given the empirical reality of the different multivariate trait distributions, "Who are the best athletes _among females_" is a natural question for people to be interested in, and want separate sports leagues to determine.
-
-(Similarly, when conducting [automobile races](https://en.wikipedia.org/wiki/Auto_racing), you want there to be rules enforcing that all competitors have the same type of car for some common-sense-reasonable operationalization of "the same type", because a race between a sports car and a [moped](https://en.wikipedia.org/wiki/Moped) would be mostly measuring who has the sports car, rather than who's the better racer.)
-
-Including males people in female sports leagues undermines the point of having a separate female league. 
-
-[TODO: more sentences explaining why HRT doesn't break taxonicity of sex, and why "gender identity" is a much less plausible joint anyway]
-
-[TODO: sentences about studies showing that HRT doesn't erase male advantage
-https://twitter.com/FondOfBeetles/status/1368176581965930501
-https://link.springer.com/article/10.1007/s40279-020-01389-3
-https://bjsm.bmj.com/content/55/15/865
-]
-
-[TODO sentences about Lia Thomas and Cece Tefler  https://twitter.com/FondOfBeetles/status/1466044767561830405 (Thomas and Tefler's feats occured after Yudkowsky's 2018 Tweets, but this kind of thing was easily predictable to anyone familiar with sex differences)
-https://www.dailymail.co.uk/news/article-10445679/Lia-Thomas-UPenn-teammate-says-trans-swimmer-doesnt-cover-genitals-locker-room.html
-]
-
-In light of these _empirical_ observations, Yudkowsky's suggestion that an ignorant comittment to an "Aristotelian binary" is the main reason someone might care about the integrity of women's sports, is revealed as an absurd strawman. This just isn't something any scientifically-literate person would write if they had actually thought about the issue _at all_, as contrasted to having _first_ decided (consciously or not) to bolster one's reputation among progressives by dunking on transphobes on Twitter, and wielding one's philosophy knowledge in the service of that political goal. The relevant empirical facts are _not subtle_, even if most people don't have the fancy vocabulary to talk about them in terms of "multivariate trait distributions".
-
-Yudkowsky's pretension to merely have been standing up for the distinction between facts and policy questions isn't credible: if you _just_ wanted to point out that the organization of sports leagues is a policy question rather than a fact (as if anyone had doubted this), why would you throw in the "Aristotelian binary" strawman and belittle the matter as "humorous"? There are a lot of issues that I don't _personally_ care much about, but I don't see anything funny about the fact that other people _do_ care.
-
-(And in this case, the empirical facts are _so_ lopsided, that if we must find humor in the matter, it really goes the other way. Lia Thomas trounces the entire field by _4.2 standard deviations_ (!!), and Eliezer Yudkowsky feels obligated to _pretend not to see the problem?_ You've got to admit, that's a _little_ bit funny.)
-
-----
-
-Having analyzed the _ways_ in which Yudkowsky is playing dumb here, what's still not entirely clear is _why_. Presumably he cares about maintaining his credibility as an insightful and fair-minded thinker. Why tarnish that by putting on this haughty performance?
-
-Of course, presumably he _doesn't_ think he's tarnishing it—but why not? [He graciously explains in the Facebook comments](https://www.facebook.com/yudkowsky/posts/10159421750419228?comment_id=10159421833274228&reply_comment_id=10159421901809228):
-
-> it is sometimes personally prudent and not community-harmful to post your agreement with Stalin about things you actually agree with Stalin about, in ways that exhibit generally rationalist principles, especially because people do _know_ they're living in a half-Stalinist environment [...] I think people are better off at the end of that.
-
-Ah, _prudence_! He continues:
-
-> I don't see what the alternative is besides getting shot, or utter silence about everything Stalin has expressed an opinion on including "2 + 2 = 4" because if that logically counterfactually were wrong you would not be able to express an opposing opinion.
-
-The problem with trying to "exhibit rationalist principles" in an line of argument that you're constructing in order to be prudent and not community-harmful, is that you're thereby necessarily _not_ exhibiting the central rationalist principle that what matters is the process that _determines_ your conclusion, not the reasoning you present to _reach_ your presented conclusion, after the fact.
-
-The best explanation of this I know was authored by Yudkowsky himself in 2007, in a post titled ["A Rational Argument"](https://www.lesswrong.com/posts/9f5EXt8KNNxTAihtZ/a-rational-argument). It's worth quoting at length. The Yudkowsky of 2007 invites us to consider the plight of a political campaign manager:
-
-> As a campaign manager reading a book on rationality, one question lies foremost on your mind: "How can I construct an impeccable rational argument that Mortimer Q. Snodgrass is the best candidate for Mayor of Hadleyburg?"
->
-> Sorry. It can't be done.
->
-> "What?" you cry. "But what if I use only valid support to construct my structure of reason? What if every fact I cite is true to the best of my knowledge, and relevant evidence under Bayes's Rule?"
->
-> Sorry. It still can't be done. You defeated yourself the instant you specified your argument's conclusion in advance.
-
-The campaign manager is in possession of a survey of mayoral candidates on which Snodgrass compares favorably to other candidates, except for one question. The post continues (bolding mine):
-
-> So you are tempted to publish the questionnaire as part of your own campaign literature ... with the 11th question omitted, of course.
->
-> **Which crosses the line between _rationality_ and _rationalization_.** It is no longer possible for the voters to condition on the facts alone; they must condition on the additional fact of their presentation, and infer the existence of hidden evidence.
->
-> Indeed, **you crossed the line at the point where you considered whether the questionnaire was favorable or unfavorable to your candidate, before deciding whether to publish it.** "What!" you cry. "A campaign should publish facts unfavorable to their candidate?" But put yourself in the shoes of a voter, still trying to select a candidate—why would you censor useful information? You wouldn't, if you were genuinely curious. If you were flowing _forward_ from the evidence to an unknown choice of candidate, rather than flowing _backward_ from a fixed candidate to determine the arguments.
-
-The post then briefly discusses the idea of a "logical" argument, one whose conclusions follow from its premises. "All rectangles are quadrilaterals; all squares are quadrilaterals; therefore, all squares are rectangles" is given as an example of _illogical_ argument, even though the both premises are true (all rectangles and squares are in fact quadrilaterals) _and_ the conclusion is true (all squares are in fact rectangles). The problem is that the conclusion doesn't _follow_ from the premises; the _reason_ all squares are rectangles isn't _because_ they're both quadrilaterals. If we accepted arguments of the general _form_ "all A are C; all B are C; therefore all A are B", we would end up believing nonsense.
-
-Yudkowsky's conception of a "rational" argument—at least, Yudkowsky's conception in 2007, which the Yudkowsky of the current year seems to disagree with—has a similar flavor: the stated reasons should be the actual reasons. The post concludes:
-
-> If you really want to present an honest, rational argument _for your candidate_, in a political campaign, there is only one way to do it:
->
-> * _Before anyone hires you_, gather up all the evidence you can about the different candidates.
-> * Make a checklist which you, yourself, will use to decide which candidate seems best.
-> * Process the checklist.
-> * Go to the winning candidate.
-> * Offer to become their campaign manager.
-> * When they ask for campaign literature, print out your checklist.
->
-> Only in this way can you offer a _rational_ chain of argument, one whose bottom line was written flowing _forward_ from the lines above it. Whatever _actually_ decides your bottom line is the only thing you can _honestly_ write on the lines above.
-
-I remember this being pretty shocking to read back in 'aught-seven. What an alien mindset! But it's _correct_. You can't rationally argue "for" a chosen conclusion, because only the process you use to _decide what to argue for_ can be your real reason.
-
-This is a shockingly high standard for anyone to aspire to live up to—but what made the Yudkowsky's Sequences so life-changingly valuable, was that they articulated the _existence_ of such a standard. For that, I will always be grateful.
-
-... which is why it's so _bizarre_ that the Yudkowsky of the current year acts like he's never heard of it! If your _actual_ bottom line is that it is sometimes personally prudent and not community-harmful to post your agreement with Stalin, then sure, you can _totally_ find something you agree with to write on the lines above! Probably something that "exhibits generally rationalist principles", even! It's just that any rationalist who sees the game you're playing is going to correctly identify you as a partisan hack on this topic and take that into account when deciding whether they can trust you on other topics.
-
-"I don't see what the alternative is besides getting shot," Yudkowsky muses (where presumably, 'getting shot' is a metaphor for a large negative utility, like being unpopular with progressives). Yes, an astute observation! And _any other partisan hack could say exactly the same_, for the same reason. Why does the campaign manager withhold the results of the 11th question? Because he doesn't see what the alternative is besides getting shot.
-
-Yudkowsky [sometimes](https://www.lesswrong.com/posts/K2c3dkKErsqFd28Dh/prices-or-bindings) [quotes](https://twitter.com/ESYudkowsky/status/1456002060084600832) _Calvin and Hobbes_: "I don't know which is worse, that everyone has his price, or that the price is always so low."
-
-If the idea of being fired from the Snodgrass campaign or being unpopular with progressives is _so_ terrifying to you that it seems analogous to getting shot, then, if those are really your true values, then sure—say whatever you need to say to keep your job and your popularity, as is personally prudent. You've set your price. But if the price you put on the intellectual integrity of your so-called "rationalist" community is similar to that of the Snodgrass for Mayor campaign, you shouldn't be surprised if intelligent, discerning people accord similar levels of credibility to the two groups' output.
-
-I see the phrase "bad faith" thrown around more than I think people know what it means. "Bad faith" doesn't mean "with ill intent", and it's more specific than "dishonest": it's [adopting the surface appearance of being moved by one set of motivations, while actually acting from another](https://en.wikipedia.org/wiki/Bad_faith).
-
-For example, an [insurance company employee](https://en.wikipedia.org/wiki/Claims_adjuster) who goes through the motions of investigating your claim while privately intending to deny it might never consciously tell an explicit "lie", but is definitely acting in bad faith: they're asking you questions, demanding evidence, _&c._ in order to _make it look like_ you'll get paid if you prove the loss occurred—whereas in reality, you're just not going to be paid. Your responses to the claim inspector aren't completely casually _inert_: if you can make an extremely strong case that the loss occurred as you say, then the claim inspector might need to put some effort into coming up with some ingenious excuse to deny your claim in ways that exhibit general claim-inspection principles. But at the end of the day, the inspector is going to say what they need to say in order to protect the company's loss ratio, as is personally prudent.
-
-With this understanding of bad faith, we can read Yudkowsky's "it is sometimes personally prudent [...]" comment as admitting that his behavior on politically-charged topics is in bad faith—where "bad faith" isn't a meaningless insult, but [literally refers](http://benjaminrosshoffman.com/can-crimes-be-discussed-literally/) to the pretending-to-have-one-set-of-motivations-while-acting-according-to-another behavior, such that accusations of bad faith can be true or false. Yudkowsky will never consciously tell an explicit "lie", but he'll go through the motions to _make it look like_ he's genuinely engaging with questions where I need the right answers in order to make extremely impactful social and medical decisions—whereas in reality, he's only going to address a selected subset of the relevant evidence and arguments that won't get him in trouble with progressives.
-
-To his credit, he _will_ admit that he's only willing to address a selected subset of arguments—but while doing so, he claims an absurd "confidence in [his] own ability to independently invent everything important that would be on the other side of the filter and check it [himself] before speaking" while _simultaneously_ blatantly mischaracterizing his opponents' beliefs! ("Gendered Pronouns For Everyone and Asking To Leave The System Is Lying" doesn't pass anyone's [ideological Turing test](https://www.econlib.org/archives/2011/06/the_ideological.html).)
-
-Counterarguments aren't completely causally _inert_: if you can make an extremely strong case that Biological Sex Is Sometimes More Relevant Than Self-Declared Gender Identity, Yudkowsky will put some effort into coming up with some ingenious excuse for why he _technically_ never said otherwise, in ways that exhibit generally rationalist principles. But at the end of the day, Yudkowsky is going to say what he needs to say in order to protect his reputation, as is personally prudent.
-
-Even if one were to agree with this description of Yudkowsky's behavior, it doesn't immediately follow that Yudkowsky is making the wrong decision. Again, "bad faith" is meant as a literal description, not a contentless attack—maybe there are some circumstances in which engaging some amount of bad faith is the right thing to do, given the constraints one faces. For example, when talking to people on Twitter with a very different ideological background from me, I sometimes anticipate that if my interlocutor knew what I was actually thinking, they wouldn't want to talk to me, so I take care to word my replies in a way that makes it look like I'm more ideologically aligned with them than I actually am. (For example, I [never say "assigned female/male at birth" in my own voice on my own platform](/2019/Sep/terminology-proposal-developmental-sex/), but I'll do it in an effort to speak my interlocutor's language.) I think of this as the _minimal_ amount of strategic bad faith needed to keep the conversation going, to get my interlocutor to evaluate my argument on its own merits, rather than rejecting it for coming from an ideological enemy. In cases such as these, I'm willing to defend my behavior as acceptable—there _is_ a sense in which I'm being deceptive by optimizing my language choice to make my interlocutor make bad guesses about my ideological alignment, but I'm comfortable with that amount and scope of deception because I don't think my interlocutor _should_ be paying attention to my personal alignment.
-
-That is, my bad faith Twitter gambit of deceiving people about my ideological alignment in the hopes of improving the discussion seems like something that makes our collective beliefs about the topic-being-argued-about _more_ accurate. (And the topic-being-argued-about is presumably of greater collective interest than which "side" I personally happen to be on.)
-
-In contrast, Yudkowsky's bad faith gambit is the exact reverse: he's making the discussion worse in the hopes of correcting people's beliefs about his own ideological alignment. (He's not a right-wing Bad Guy, but people would tar him as a right-wing Bad Guy if he ever said anything negative about trans people.) This doesn't improve our collective beliefs about the topic-being-argued about; it's a _pure_ ass-covering move.
-
-Yudkowsky names the alleged fact that "people do _know_ they're living in a half-Stalinist environment" as a mitigating factor. But the _reason_ censorship is such an effective tool in the hands of dictators like Stalin is because it ensures that many people _don't_ know—and that those who know (or suspect) don't have [game-theoretic common knowledge](https://www.lesswrong.com/posts/9QxnfMYccz9QRgZ5z/the-costly-coordination-mechanism-of-common-knowledge#Dictators_and_freedom_of_speech) that others do too.
-
-Zvi Mowshowitz has [written about how the false assertion that "everybody knows" something](https://thezvi.wordpress.com/2019/07/02/everybody-knows/) is typically used justify deception: if "everybody knows" that we can't talk about biological sex (the reasoning goes), then no one is being deceived when our allegedly truthseeking discussion carefully steers clear of any reference to the reality of biological sex when it would otherwise be extremely relevant.
-
-But if it were _actually_ the case that everybody knew (and everybody knew that everybody knew), then what would be the point of the censorship? It's not coherent to claim that no one is being harmed by censorship because everyone knows about it, because the entire appeal and purpose of censorship is precisely that _not_ everybody knows and that someone with power wants to _keep_ it that way.
-
-For the savvy people in the know, it would certainly be _convenient_ if everyone secretly knew: then the savvy people wouldn't have to face the tough choice between
-acceding to Power's demands (at the cost of deceiving their readers) and informing their readers (at the cost of incurring Power's wrath).
-
-Policy debates should not appear one-sided. Faced with this kind of dilemma, I can't say that defying Power is necessarily the right choice: if there really _were_ no other options between deceiving your readers with a bad faith performance, and incurring Power's wrath, and Power's wrath would be too terrible to bear, then maybe deceiving your readers with a bad faith performance is the right thing to do.
-
-But if you actually _cared_ about not deceiving your readers, you would want to be _really sure_ that those _really were_ the only two options. You'd [spend five minutes by the clock looking for third alternatives](https://www.lesswrong.com/posts/erGipespbbzdG5zYb/the-third-alternative)—including, possibly, not issuing proclamations on your honor as leader of the so-called "rationalist" community on topics where you _explicitly intend to ignore counteraguments on grounds of their being politically unfavorable_. Yudkowsky rejects this alternative on the grounds that it allgedly implies "utter silence about everything Stalin has expressed an opinion on including '2 + 2 = 4' because if that logically counterfactually were wrong you would not be able to express an opposing opinion", but this seems like yet another instance of Yudkowsky playing dumb: if he _wanted_ to, I'm sure Eliezer Yudkowsky could think of _some relevant differences_ between "2 + 2 = 4" (a trivial fact of arithmetic) and "the simplest and best protocol is, "'He' refers to the set of people who have asked us to use 'he'" (a complex policy proposal whose flaws I have analyzed in detail above).
-
-
-
-["People are better off at the end of that"— _who_ is better off? We need a conflict-theoretic analysis]
-
-[I was going to save the Whole Dumb Story for a different post, and keep this post narrowly scoped to just critiquing the Feb. 2021 pronouns post, but in order to explain the problem with "Everybody knows" and "People are better off after that", I need to breifly summarize the context of this discussion which explains why _I_ didn't know and _I'm_ not better off—with the understanding that this only a brief summary, and I might tell the long version in a separate post—if it's still necessary, relative to everything else I need to get around to writing]
-
-[I _never_ expected to end up arguing about the mintuiae of pronoun conventions; I wanted to talk about the real issues]
-
-[It all started back in the 'aughts, when the occasional things about sex differences that cropped up in the Sequences (especially "Changing Emotions" and the Extropians mailing list pre) turned out to be useful for understanding what was going on with my gender thing; I wrote about this in a previous post, ["Sexual Dimorphism in Yudkowsky's Sequences, in Relation to my Gender Problems"](/2021/May/sexual-dimorphism-in-the-sequences-in-relation-to-my-gender-problems/).]
-
-[But that was all about me—I assumed "trans" was a different thing. My first clue that I might not be living in that world came from—Eliezer Yudkowsky, with the "at least 20% of the ones with penises are actually women" thing]
-
-[So I ended up arguing with people about the two-type taxonomy, and I noticed that those discussions kept getting _derailed_ on some variation of "The word woman doesn't actually mean that". So I took the bait, and starting arguing against that, and then Yudkowsky comes back to the subject with his "Hill of Validity in Defense of Meaning"—and I go on a philosophy of language crusade, and Yudkowsky eventually clarifies, and _then_ he comes back _again_ in Feb. 2022 with his "simplest and best protocol"]
-
-[At this point, the nature of the game is very clear. Yudkowsky wants to mood-affiliate with being on the right side of history, subject to the constraint of not saying anything false. I want to actually make sense of what's actually going on in the world, because _I need the correct answer to decided whether or not to cut my dick off_. On "his turn", he comes up with some pompous proclamation that's optimized to make the "pro-trans" faction look smart and good and the "anti-trans" faction look dumb and bad, "in ways that exhibit generally rationalist principles." On my turn, I put in an absurd amount of effort explaining in exhaustive, _exhaustive_ detail why Yudkowsky's pompous proclamation was substantively misleading as constrated to what you would say if you were actually trying to make sense of the world.]
-
-[nearest unblocked strategy; I would prefer to have a real discussion under the assumption of good faith, but _I tried that first_. Object-level disucssion with Yudkowsky is a waste of time as long as he's going to play these games; there's nothing left for me to do but jump up a meta level and explain, to anyone who capable of hearing it, why in this case the assumption of good faith has been empirically falsified]
-
-[If it were _actually true_ that there was no harm from the bad faith because people know they're living in a half-Stalinist environment, then he wouldn't have tried to get away with the "20% of the ones with penises" thing]
-
-[All this despite the fact that all my heretical opinions are _literally_ just his opinions from the 'aughts. Seriously, you think I'm smart enough to come up with all of this indepedently? I'm not! I ripped it all off from Yudkowsky back in the 'aughts when he still gave a shit about telling the truth in this domain. Does he expect us not to notice? Well, I guess it's been working out for him so far.]
-
-[Agreeing with Stalin that 2+2=4 is fine; the problem is a sustained pattern of _selectively_ bring up pro-Party points while ignoring anti-Party facts that would otherwise be relevant to the topic of interest, including stonewalling commenters who try to point out relevance; I think I'm doing better: I can point to places where I argue "the other side", because I know that sides are fake]
-
-[I can win concessions, like "On the Argumentative Form", but I don't want concessions; I want to _actually get the goddamned right answer_]
-
--------
-
-[Why does this matter? It would be dishonest for me to claim that this is _directly_ relevant to xrisk, because that's not my real bottom line]
-
-a rationality community that can't think about _practical_ issues that affect our day to day lives, but can get existential risk stuff right, is like asking for self-driving car software that can drive red cars but not blue cars
-
-It's a _problem_ if public intellectuals in the current year need to pretend to be dumber than seven-year-olds in 2016
-
-https://www.readthesequences.com/
-> Because it is all, in the end, one thing. I talked about big important distant problems and neglected immediate life, but the laws governing them aren't actually different.
-
-> the challenge is almost entirely about high integrity communication by small groups
-https://twitter.com/HiFromMichaelV/status/1486044326618710018