Monday morning tap at "Challenges"

[Ultimately_Untrue_Thought.git] / content / drafts / challenges-to-yudkowskys-pronoun-reform-proposal.md
diff --git a/content/drafts/challenges-to-yudkowskys-pronoun-reform-proposal.md b/content/drafts/challenges-to-yudkowskys-pronoun-reform-proposal.md

index bda0d11..2f2c056 100644 (file)
--- a/content/drafts/challenges-to-yudkowskys-pronoun-reform-proposal.md
+++ b/content/drafts/challenges-to-yudkowskys-pronoun-reform-proposal.md
@@ -79,17 +79,41 @@ But given that pronouns _do_ convey sex-category information, as a _fact_ about
  
  In an article titled ["Pronouns are Rohypnol"](https://fairplayforwomen.com/pronouns/), Barra Kerr compares preferred pronouns to the famous [Stroop effect](https://en.wikipedia.org/wiki/Stroop_effect). When color words are printed in text of a different color (_e.g._, <span style="color:blue;">red</span>, <span style="color:green">orange</span>, <span style="color:red">yellow</span>, <span style="color:purple">green</span>, <span style="color:orange">blue</span>, _&c._) and people are asked to name the color of the text, they're slow to respond: the meaning of the word interferes with their ability to name the color in front of our eyes.
  
  
  In an article titled ["Pronouns are Rohypnol"](https://fairplayforwomen.com/pronouns/), Barra Kerr compares preferred pronouns to the famous [Stroop effect](https://en.wikipedia.org/wiki/Stroop_effect). When color words are printed in text of a different color (_e.g._, <span style="color:blue;">red</span>, <span style="color:green">orange</span>, <span style="color:red">yellow</span>, <span style="color:purple">green</span>, <span style="color:orange">blue</span>, _&c._) and people are asked to name the color of the text, they're slow to respond: the meaning of the word interferes with their ability to name the color in front of our eyes.
  
-Kerr suggests that preferred pronouns have a similar effect, that "a conflict between what we see [...] and what we are expected to say, affects us." As an exercise, she suggests (privately!) translating sentences about transgender people to use natal-sex-based pronouns, and honestly asking oneself: "Do you feel differently, on reading it this way? Do you react differently?"
+Kerr suggests that preferred pronouns have a similar effect, that "a conflict between what we see and know to be true, and what we are expected to say, affects us." As an exercise, she suggests (privately!) translating sentences about transgender people to use natal-sex-based pronouns.
+
+Unfortunately, I don't have a study with objective measurements on hand (let me know in the comments if you do!), but I think most native English speakers who try this exercise and introspect—especially using examples where the trans person exhibits features or behavior typical of their natal sex—will agree with Kerr's assessment: "You can know perfectly the actual sex of a male person, and yet you will still react differently if someone calls them _she_ instead of _he_."
+
+Let's relate this is Yudkowsky's specialty of artificial intelligence. In a post on ["Multimodal Neurons in Artificial Neural Networks"](https://openai.com/blog/multimodal-neurons/), Gabriel Goh _et al._ explore the capabilities and biases of the [CLIP](https://openai.com/blog/clip/) neural network trained on textual and image data.
+
+There are some striking parallels between CLIP's behavior, and phenomena observed in neuroscience. Neurons in the human brain have been observed to respond to the same concept represented in different modalities (_e.g._, [Quiroga _et al._](/papers/quiroga_et_al-invariant_visual_representation_by_single_neurons.pdf) observed a neuron in one patient that responded to photos and sketches of actress Halle Berry, as well as the text string "Halle Berry"), and so do CLIP neurons. Futhermore, CLIP is vulnerable to a Stroop-like effect where its image-classification capabilities can be fooled by "typographic attacks"—a dog with instances of the text "$$$" superimposed over it gets classified as a piggy bank, an apple with a handwritten sign saying "LIBRARY" gets classified as a library. The network knows perfectly what dogs and apples look like under ordinary circumstances, and yet still reacts differently when presented with clashing textual labels.
+
+I conjecture that the appeal of subject-chosen pronouns lies _precisely_ in how they exert Stroop-like effects on speakers' cognition. (Once again, if it were _actually true_ that _she_ and _he_ had no difference in meaning, _there would be no reason to care_.) [Pronoun badges](/2018/Oct/sticker-prices/) are, quite literally, a typographic attack against native English speakers' brains.
+
+Note, I mean this as a value-free description of how the convention _actually functions_ in the real world, [not a condemnation](https://www.lesswrong.com/posts/N9oKuQKuf7yvCCtfq/can-crimes-be-discussed-literally). One could consistently hold that these "attacks" are morally good—
+
+
+Is susceptibility to Stroop-like effects an indication of bad mind design? I mean, maybe! You could argue that! One would expect that an _intelligently_-designed agent (as contrasted to messy human brains coughed up [blind evolution](https://www.lesswrong.com/posts/jAToJHtg39AMTAuJo/evolutions-are-stupid-but-work-anyway) or [lucky](https://www.lesswrong.com/posts/dpzLqQQSs7XRacEfK/understanding-the-lottery-ticket-hypothesis) neural networks found by gradient descent) could easily bind and re-bind symbols on the fly: 
+
+
  
  
-[TODO: I have a marketing problem here; the fact that Kerr chose a sexual violence example is actually kind of important here; if the sentence was about borrowing vacuum cleaners, then people in Berkeley _will_ play dumb]
  
  
-Unfortunately, I don't have a study with objective measurements on hand (let me know in the comments if you do!) but I think native English speakers who try this exercise and introspect will agree with Barr's assessment: "You can know perfectly the actual sex of a male person, and yet you will still react differently if someone calls them _she_ instead of _he_."
  
  
-[TODO: Contrary to Yudkowskys' claims about lies, Kerr _isn't_ claiming that pronouns can be "lies"; the article is _very_ explicit about this; Yudkowsky is obviously completely unfamiliar with his opponents' arguments]
  
  [TODO: let's related this to Yudkowsky's specialty multimodal neurons— both CLIP and biological neurons respond to text/images; typographic attacks are the same thing as pronoun badges; you would expect the people aligning language models to be able to think these thoughts]
  
  
  [TODO: let's related this to Yudkowsky's specialty multimodal neurons— both CLIP and biological neurons respond to text/images; typographic attacks are the same thing as pronoun badges; you would expect the people aligning language models to be able to think these thoughts]
  
-Given this multitude of reasons why the _existing_ meanings of _she_ and _he_ are relevant to the question of pronoun reform, what is Yudkowsky's response?
+Importantly, Kerr is _explicitly_ appealing to psychological effects of different pronoun conventions. She is absolutely _not_ claiming that the use of preferred pronouns is itself a "lie" about some testable proposition. She writes:
+
+> I've heard many people tell me they don't mind doing this, as a courtesy, although it takes some effort to keep up the mental gymnastics of perceiving one sex, but consistently using pronouns for the other. That's a personal choice, and I respect the reasons why some people make it.
+
+> I've also heard many people declaring that anyone who won't comply (usually directed at a woman) is obnoxious, mean, hostile, and unpleasant. 'Misgendering' is hate speech. They say.
+
+> But I refuse to use female pronouns for anyone male.
+
+Note the wording: "That's a personal choice", "_I_ refuse". She knows perfectly well that people who use gender-identity-based pronouns aren't making a false claim that trans men produce sperm, _&c._! Rather, she's saying that a pronoun convention that groups together females, and a minority of males who wish they were female, affects our cognition about that minority of males in a way that's disadvantageous to Kerr's interests (because she wants to be especially alert to threats posed by males), such that Kerr refuses to comply with that convention in her own speech. (Compare to how a Spanish speaker might refuse to address someone they disrespected as _usted_ because of its connotations, without thereby claiming that using _usted_ would make the sentence literally false.)
+
+I take pains to emphasize this because Yudkowsky [misrepresents what his political opponents are typically claiming](https://slatestarcodex.com/2014/05/12/weak-men-are-superweapons/), repeatedly trying to frame the matter of dispute as to whether pronouns can be "lies" (to which Yudkowsky says, No, that would be ontologically confused)—whereas if you _actually read_ what the people on the other side of the policy debate are saying, they're largely _not claiming_ that "pronouns are lies"! (It seems fair to regard Kerr's article as representative of gender-critical ("TERF") concerns; I've seen the post linked in those circles more than once, and it's cited in [embattled former University of Sussex professor Kathleen Stock](https://en.wikipedia.org/wiki/Kathleen_Stock#Views_on_gender_self-identification)'s book _Material Girls_.)
+
+Anyway, given these reasons why the _existing_ meanings of _she_ and _he_ are relevant to the question of pronoun reform, what is Yudkowsky's response?
  
  Apparently, to play dumb. In the comments of the Facebook post, Yudkowsky claims:
  
  
  Apparently, to play dumb. In the comments of the Facebook post, Yudkowsky claims:
  
@@ -97,16 +121,20 @@ Apparently, to play dumb. In the comments of the Facebook post, Yudkowsky claims
  
  ...
  
  
  ...
  
-I'm sorry, but I can't take this self-report literally. I certainly [don't think Yudkowsky was _consciously_ lying](https://www.lesswrong.com/posts/bSmgPNS6MTJsunTzS/maybe-lying-doesn-t-exist) when he wrote that. Nevertheless, I am _incredibly_ skeptical that Yudkowsky _actually_ doesn't know what it feels like from the inside to feel like a pronoun is attached to sex more firmly than a proper name is attached to someone's appearance.
+I'm sorry, but I can't take this self-report literally. I certainly [don't think Yudkowsky was _consciously_ lying](https://www.lesswrong.com/posts/bSmgPNS6MTJsunTzS/maybe-lying-doesn-t-exist) when he wrote that. (When speaking or writing quickly without taking the time to scrupulously check, [it's common for little untruths and distortions to slip into one's speech](https://www.lesswrong.com/posts/pZSpbxPrftSndTdSf/honesty-beyond-internal-truth).) Nevertheless, I am _incredibly_ skeptical that Yudkowsky _actually_ doesn't know what it feels like from the inside to feel like a pronoun is attached to sex more firmly than a proper name is attached to someone's appearance.
  
  
-[TODO: how could you possibly know that?]
+I realize this must seem impossibly rude and presumptuous of me. Yudkowsky _said_ he doesn't know what it feels like from the inside! That's a report out his own mental state, which he has privileged introspective access to, and I don't! What grounds could I possibly, _possibly_ have to think he's not telling the truth about his own mind? 
+
+It's a good question. And my answer is, even without mind-reading technology, people's minds are still part of the same cause-and-effect physical universe that I can (must) make probabilistic inferences about, and verbal self-reports aren't my _only_ source of evidence about someone's mind. In particular, if someone's verbal self-report mis-predicts what we know about their _behavior_, it's far from clear that we should trust the report more than our senses.
  
  The thing is, Eliezer Yudkowsky is a native English speaker born in 1979. As a native English speaker born in 1987, I have a _pretty good_ mental model of how native English speakers born in the late 20th century use language.
  
  And one of the things native English speakers born in the late 20th century are _very good_ at doing, is noticing what sex people are and using the corresponding pronouns without consciously thinking about it, because the pronouns are attached to the concept of sex in their heads more firmly than proper names are attached to something in their heads.
  
  
  The thing is, Eliezer Yudkowsky is a native English speaker born in 1979. As a native English speaker born in 1987, I have a _pretty good_ mental model of how native English speakers born in the late 20th century use language.
  
  And one of the things native English speakers born in the late 20th century are _very good_ at doing, is noticing what sex people are and using the corresponding pronouns without consciously thinking about it, because the pronouns are attached to the concept of sex in their heads more firmly than proper names are attached to something in their heads.
  
+I would bet at very generous odds at some point in his four decades on Earth, Eliezer Yudkowsky has used _she_ or _he_ on the basis of perceived sex to refer to someone whose name he didn't know. Because _all native English speakers do this_. Moreover, we can say something about the cognitive algorithm underlying _how_ they do this: [people can recognize sex from facial structure _alone_ (hair covered, males clean-shaven) at 96% accuracy](/papers/bruce_et_al-sex_discrimination_how_do_we_tell.pdf)
+
  
  
-I would bet at very generous odds at some point in his four decades on Earth, Eliezer Yudkowsky has used _she_ or _he_ on the basis of perceived sex to refer to someone whose name he didn't know. Because _all native English speakers do this_.
+I would also bet at very generous odds that in his four decades on Earth, Eliezer Yudkowsky has very rarely if ever assumed what someone's name is on the basis of their appearance without being told.
  
  
  
  
  
  
@@ -122,7 +150,7 @@ Okay, so Yudkowsky
  
  [TODO: self-identity is a Schelling point]
  
  
  [TODO: self-identity is a Schelling point]
  
-
+appeal to inner privacy conversation-halter https://www.lesswrong.com/posts/wqmmv6NraYv4Xoeyj/conversation-halters
  
  
  [OUTLINE of remainder—
  
  
  [OUTLINE of remainder—
@@ -133,7 +161,7 @@ Okay, so Yudkowsky
   * "It can't be based on feelings"—hypocrisy, the only reason we're talking about this at all is because of genderspecial people's feelings, as explicitly acknowledged in the OP!!!
   * "Can't imagine a sympathetic protagonist"—lies, imagine a rape victim
   * "If there were unspeakable arguments against, we couldn't talk about them"—okay, then you and your rationalists are frauds
   * "It can't be based on feelings"—hypocrisy, the only reason we're talking about this at all is because of genderspecial people's feelings, as explicitly acknowledged in the OP!!!
   * "Can't imagine a sympathetic protagonist"—lies, imagine a rape victim
   * "If there were unspeakable arguments against, we couldn't talk about them"—okay, then you and your rationalists are frauds
- * I know none of this matters, but one would have thought that the _general_ skills of correct argument would matter for saving the world ... right? / brief recap of my Whole Dumb Story, need the correct answer in order to decide
+ * I know none of this matters (If any professional alignment researchers wasting time reading this instead of figuring out how to save the world, get back to work!!), but one would have thought that the _general_ skills of correct argument would matter for saving the world.
  
  somewhere—
   * Douglas Hofstader also made fun of gendered pronouns with his "Person Paper"—but notice that he didn't even consider the self-chosen criterion!!
  
  somewhere—
   * Douglas Hofstader also made fun of gendered pronouns with his "Person Paper"—but notice that he didn't even consider the self-chosen criterion!!
@@ -176,6 +204,7 @@ Fit in somewhere—
  • typographic attacks https://openai.com/blog/multimodal-neurons/  
  • singular     
  
  • typographic attacks https://openai.com/blog/multimodal-neurons/  
  • singular     
  
+
  https://www.ehu.eus/seg/_media/gizt/5/5/brown-gilman-pronouns.pdf
  
  > In terms of important things?  Those would be all the things I've read - from friends, from strangers on the Internet, above all from human beings who are people - describing reasons someone does not like to be tossed into a Male Bucket or Female Bucket, as it would be assigned by their birth certificate, or perhaps at all.
  https://www.ehu.eus/seg/_media/gizt/5/5/brown-gilman-pronouns.pdf
  
  > In terms of important things?  Those would be all the things I've read - from friends, from strangers on the Internet, above all from human beings who are people - describing reasons someone does not like to be tossed into a Male Bucket or Female Bucket, as it would be assigned by their birth certificate, or perhaps at all.