Steven Pinker on Coordination, Common Knowledge, and the Retreat of Liberal Enlightenment (Ep. 255)

Steven Pinker returns to Conversations with Tyler with an argument that common knowledge—those infinite loops of “I know that you know that I know”—is the hidden infrastructure that enables human coordination, from accepting paper money to toppling dictators. But Tyler wonders: if most real-world coordination works fine without recursively looping (a glance at a traffic circle), if these models break down with the slightest change in assumptions, and if anonymous internet posters are making correct but uncomfortable truths common knowledge when society might function better with noble lies, is Pinker’s theory really capturing how coordination works—and might we actually need less common knowledge, not more?

Tyler and Steven probe these dimensions of common knowledge—Schelling points, differential knowledge, benign hypocrisies like a whisky bottle in a paper bag—before testing whether rational people can actually agree (spoiler: they can’t converge on Hitchcock rankings despite Aumann’s theorem), whether liberal enlightenment will reignite and why, what stirring liberal thinkers exist under the age 55, why only a quarter of Harvard students deserve A’s, how large language models implicitly use linguistic insights while ignoring linguistic theory, his favorite track on Rubber Soul, what he’ll do next, and more.

Watch the full conversation

Recorded September 12th, 2025.

Read the full transcript

TYLER COWEN: Hello everyone, and welcome back to Conversations with Tyler. Today I’m delighted to be chatting with Steven Pinker. Steven needs no introduction, per se, but I will point out, he has a new and excellent book called When Everyone Knows That Everyone Knows . . . The subtitle is Common Knowledge and the Mysteries of Money, Power, and Everyday Life. Steven, welcome.

STEVEN PINKER: Thank you.

COWEN: How important is the reasoning behind common knowledge? Say you get a bunch of men in the room, and at the end of some thinking, one of them concludes that he has spinach on his teeth, using common knowledge. Let’s say that mode of reasoning was somehow not allowed to us, how much lower would GDP be?

PINKER: It would tank. It would be almost nonexistent just because all of our economy depends on coordination, on people doing things that are in everyone’s interest if everyone does them the same way, beginning with money itself, with currency. Why do we accept a piece of paper in exchange for something of value? We know other people will accept it and give us something of value. Why do they do that? They know that still other people will. Everyone knows that everyone knows that money has value. That’s what gives it value, as long as they do.

That can unravel when you have hyperinflation, or a bank run, or a currency attack. Political power. No government can have snipers on every rooftop and telescreens in every room surveilling you 24/7 with the Stasi ready to pounce on you if you break a law. A lot of our laws, a lot of recognition of authority, who’s the boss, depend on just common knowledge, namely, he’s the boss or she’s the boss. She gets her way because she knows I’ll obey. I’ll obey because I know she gets her way. For as long as that lasts, as long as that common knowledge lasts, you have structures of authority and reporting and power and responsibility.

Again, since it depends on nothing but common expectations, it can unravel as when you have a big public demonstration where people see each other, seeing each other, seeing each other opposing the regime, and then the authority of the regime can evaporate.

On the difference between coordination and common knowledge

COWEN: Surely there’s a difference between coordination and common knowledge. I think of common knowledge as an extremely recursive model that typically has an infinite number of loops. Most of the coordination that goes on in the real world is not like that. If I approach a traffic circle in Northern Virginia, I look at the other person, we trade glances. There’s a slight amount of recursion, but I doubt if it’s ever three loops. Maybe it’s one or two.

We also have to slow down our speeds precisely because there are not an infinite number of loops. We coordinate. What percentage of the coordination in the real world is like the traffic circle example or other examples, and what percentage of it is due to actual common knowledge?

PINKER: Common knowledge, in the technical sense, does involve this infinite number of arbitrarily embedded beliefs about beliefs about beliefs. Thank you for introducing the title with the three dots, dot, dot, dot, because that’s what signals that common knowledge is not just when everyone knows that everyone knows, but when everyone knows that everyone knows that and so on. The answer to your puzzle — and I devote a chapter in the book to what common knowledge — could actually consist of, and I’m a psychologist, I’m not an economist, a mathematician, a game theorist, so foremost in my mind is what’s going on in someone’s head when they have common knowledge.

You’re right. We couldn’t think through an infinite number of “I know that he knows” thoughts, and our mind starts to spin when we do three or four. Instead, common knowledge can be generated by something that is self-evident, that is conspicuous, that’s salient, that you can witness at the same time that you witness other people witnessing it and witnessing you witnessing it. That can grant common knowledge in a stroke. Now, it’s implicit common knowledge.

One way of putting it is you have reason to believe that he knows that I know that he knows that I know that he knows, et cetera, even if you don’t literally believe it in the sense that that thought is consciously running through your mind. I think there’s a lot of interplay in human life between this recursive mentalizing, that is, thinking about other people thinking about other people, and the intuitive sense that something is out there, and therefore people do know that other people know it, even if you don’t have to consciously work that through.

You gave the example of norms and laws, like who yields at an intersection. The eye contact, though, is crucial because I suggest that eye contact is an instant common knowledge generator. You’re looking at the part of the person looking at the part of you, looking at the part of them. You’ve got instant granting of common knowledge by the mere fact of making eye contact, which is why it’s so potent in human interaction and often in other species as well, where eye contact can be a potent signal.

There are even species that can coordinate without literally having common knowledge. I give the example of the lowly coral, which presumably not only has no beliefs, but doesn’t even have a brain with which to have beliefs. Coral have a coordination problem. They’re stuck to the ocean floor. Their sperm have to meet another coral’s eggs and vice versa. They can’t spew eggs and sperm into the water 24/7. It would just be too metabolically expensive. What they do is they coordinate on the full moon.

On the full moon or, depending on the species, a fixed number of days after the full moon, that’s the day where they all release their gametes into the water, which can then find each other. Of course, they don’t have common knowledge in knowing that the other will know. It’s implicit in the logic of their solution to a coordination problem, namely, the public signal of the full moon, which, over evolutionary time, it’s guaranteed that each of them can sense it at the same time.

Indeed, in the case of humans, we might do things that are like coral. That is, there’s some signal that just leads us to coordinate without thinking it through. The thing about humans is that because we do have or can have recursive mentalizing, it’s not just one signal, one response, full moon, shoot your wad. There’s no limit to the number of things that we can coordinate creatively in evolutionarily novel ways by setting up new conventions that allow us to coordinate.

COWEN: I’m not doubting that we coordinate. My worry is that common knowledge models have too many knife-edge properties. Whether or not there are timing frictions, whether or not there are differential interpretations of what’s going on, whether or not there’s an infinite number of messages or just an arbitrarily large number of messages, all those can matter a lot in the model. Yet actual coordination isn’t that fragile. Isn’t the common knowledge model a bad way to figure out how coordination comes about?

PINKER: There could be a direct link between a public conspicuous signal and the desire to coordinate. I think you’re right. That’s probably what happens most of the time. In cases where there’s doubt as to whether this signal has been received or how it’s interpreted, or where there’s no way in which you can generate such a salient signal, that’s when we get what’s often interesting in life. Namely, you’re trying to get inside someone else’s head, and, “Do they know what I know or do they know that I know it?”

A lot of our not our routine automatic interactions, but the ones where we stop and think and calibrate our responses, are cases where recursive mentalizing does come into play. The choice of words in a fraught situation would be an example. A veiled bribe, a sexual come-on, a solicitation for a big favor like a donation, where you put some mental effort into exactly how do I frame my overture so that my hearer catches my drift without knowing that I know that they know, which could change the basis of a relationship.

By the way, I’ve skipped a step here, which is that I suggest that human relationships like friendship, like authority, like romance, like transaction partners are all coordination games that are solved by common knowledge. Sometimes, when we want not to change the relationship in force, we actually take steps to avoid common knowledge, as in innuendos and euphemisms, where the message gets through, but there’s enough uncertainty as to whether your hearer knows that you know that they know that the relationship can survive the threat from the overture.

COWEN: Doesn’t differential knowledge play a big role in helping us understand what will happen? Say I’m part of an oligarchic elite, and we all decide the actual lead dictator is about to fall. That becomes common knowledge within the oligarchic elite. That’s not enough. Someone has to move first. Someone has to have differential knowledge that maybe they can take over, or maybe they’re the more courageous one, or maybe they know who in the armed forces will support them. The actual model of cooperation is this optimal blend of differential and common rather than just common knowledge. No?

PINKER: Yes, I think that’s right. Often, once they make their move, they generate the common knowledge that allows everyone to coordinate, but there has to be some kind of either private knowledge or private initiative that launches the proliferation of common knowledge. I start off with the story of the emperor’s new clothes. Why did that little boy blurt out that the emperor was naked, which then generated the common knowledge? Now everyone knows that everyone knows. Why him? Why then?

I think part of the understanding of the story is that children, being naive and innocent about certain conventions and norms that govern adult life, often do blurt out truths; they don’t have the same internal payoff structure that adults do. They don’t have the fear of ostracism or condemnation that adults do.

COWEN: For a while now, we’ve had this thing called anonymous posters on the internet. Do you think, on average, they make the world better or worse?

PINKER: Interesting. Probably worse.

COWEN: They help make some things common knowledge. If there are things that are not common knowledge, some anonymous poster will say it. There’ll also be misinformation. Is that a clue that maybe at the margin, we have too much common knowledge rather than too little?

PINKER: It’s a good question. The way you posed the question is, on average, is it better or worse? It’s kind of a matter of counting up the cases where it is a little boy saying the emperor was naked and, unlike the story, using anonymity to be able to escape reprisals, punishment for generating the common knowledge. You’d have to count up the number of cases where that happens compared to all of the misinformation, trolling, hiding behind reprisals as a way of being aggressive.

I think a lot of the shaming mobs, a lot of the trolling, a lot of the degradation of discourse in social media comes because of that anonymity. Is it worth it because sometimes it is a little boy saying the emperor’s naked? That would be harder to tell.

COWEN: Take even the cases where the anonymous posters are right. As you and I know, there’s a lot of alt-right views out there. In general, I don’t agree with them. I think they’re quite wrong. I also think that if some of them were correct, some of the conspiracy theories, it would actually be better if we don’t all know them, that we would lose too much faith in elites. It’s better that we do not have too much common knowledge, again, if those claims were true. Do you agree?

PINKER: I guess you’d have to set up the hypothetical. That is, are there cases where there’s a noble lie, where we’re better off if everyone believes it? I think a somewhat different case, other than an utter lie, would be a genteel, benign hypocrisy, where you have a common pretense. Your sometime-collaborator Robin Hanson writes about that in his book with Kevin Simler, where he gives some examples, and I actually repeat them and add some of my own, of cases where it’s not an outright lie, but it’s a case where there’s some things that are true that you avoid saying.

Robin Hanson gives examples like if someone drinks from a whiskey bottle in a paper bag on a park bench, the paper bag isn’t really fooling anyone, but the police may choose not to arrest him under the pretense that he’s not drinking an alcoholic beverage in public. Why would they do that? If they try to enforce the letter of the law, it would be too much like a police state. It’d be a waste of resources.

On the other hand, if they fail to arrest someone who is overtly drinking, they’d be surrendering the authority of the rule of law in general. The paper bag, what it does is it allows them to say the rule of law still holds in general, even if we choose, as we say, and it’s an interesting idiom, to look the other way. There are other cases in law enforcement that work like that. Escort services. Does anyone really believe that someone’s paying only for an escort, but they get away with it? Tobacco paraphernalia, which is basically drug paraphernalia.

Of course, it’s a question of where you draw the line so that the law doesn’t forfeit all of its authority. By which I mean, and going back to the top of the conversation, the common knowledge that the rule of law applies, which is necessary given that the police can’t enforce everything 24/7. Then on the international stage, things like, is Taiwan a sovereign state? Of course it is, but the official policy of the United States is that it doesn’t exist, and of pretty much every other country, as far as I know.

COWEN: Taiwan included, in a funny way.

PINKER: Taiwan included, yes. I don’t know if that would be an example. It’s not exactly a noble lie, but it’s a common pretense. Again, you can spell out the reason, although if you spell out the reason, then that undermines the reason. The reason, of course, being that it would be considered an affront, a challenge to China’s sense of authority, prestige, standing, preeminence, because it stakes a claim to all of historic China, which, if it was explicitly breached by formally recognizing Taiwan, it would be a challenge to China’s authority. If nations act as if it was a sovereign state without saying it’s a sovereign state, then China can maintain the pretense that its claim is valid.

Yet another example is, does Israel possess nuclear weapons? Their official policy is, “We’re not saying yes, we’re not saying no.” Now, everyone knows that Israel has nuclear weapons. If there wasn’t some suspicion, the weapons would be useless because they wouldn’t be a deterrent. On the other hand, if it was part of the official policy that Israel was a nuclear state, then its Arab rivals would feel that they had to acquire them as well so as not to lose face and standing.

COWEN: Say determinism were true and free will were false, it would be disastrous if we all believed that. I think it’s a good thing there’s not more common knowledge because I think determinism is true.

PINKER: That’s a whole other discussion. One could say that determinism is necessary for the incentive structure that holding people responsible imposes. Namely, if I condemn you, punish you, shame you, hold you responsible for bad choices, you’re less likely to make those bad choices. That’s why we have a regime of holding people responsible. Paradoxically, you need some degree of determinism for the idea of holding people responsible for their choices to have any effect.

On Schelling points vs. common knowledge

COWEN: Is a Schelling point the same thing as common knowledge in your approach?

PINKER: A Schelling point, I think it’s more like common salience. It’s closely related. A Schelling point is a choice that people make to coordinate that has no intrinsic value other than the fact that everyone else knows that it exists. It’s something that pops out. Schelling’s original example is a couple separated in New York. This is before the era of cellphones. How do they coordinate in the absence of common knowledge? It can’t just be that each tries to guess where the other one would go, because if he knows that she likes to go to the bookstore, so he goes to the bookstore, the problem might be that she knows he likes to go to the camera store, so she might go to the camera store. He knows that, so maybe he should go to the camera store, but then she knows that he knows that she likes to go to the bookstore, so she might think, “Oh, he’s going to go to the bookstore after all, ad infinitum.”

If there’s some point in the landscape that commends itself as something that is salient enough, obvious enough, recommends itself, like in Schelling’s example, he guessed that the couple would meet at the clock in the middle of Grand Central Station at noon, even if it wasn’t where they would have been separated, simply because if you have to pick some place, what place are you going to pick? A Schelling point is something that is salient enough to enough people that they alight on it as a solution to a coordination problem in the absence of common knowledge, per se.

COWEN: Steven, where would you go in New York to actually meet them? I know where you’d go to meet me, Grand Central Station.

PINKER: It would pop into our minds, and each of us knows it would pop into the other’s mind because we both read Tom Schelling. Nowadays, Grand Central Station isn’t what it used to be, and it’s quite possible that it wouldn’t be a Schelling point. That is, a point that serves as a coordination equilibrium simply because each one knows that the other one knows that it will pop into the other one’s mind. Schelling actually addresses that. He wrote in 1960 when Grand Central was much more influential.

Indeed, I think people did studies, and they showed that people actually would, as he predicted, use the clock at Grand Central, not as a plurality of choices. As Schelling notes, because he noted this is very fragile, he said this is the point when it comes to actually choosing what we now call a Schelling point, common salience, this is where game theory, rational analysis, logic, leave off, and human whimsy, caprice, even poetry, take over. That is, anything that pops into mind as a possible solution is a solution because it popped into your mind if it’s likely to pop into the other person’s mind.

He suggested maybe the couple will rely on whimsy and look for each other at the lost and found, or maybe something came up in conversation the night before, or maybe there was a guy with a sandwich board advertising a particular coffee shop. When it comes to the points that people converge on because the other one knows that they will converge on it, there doesn’t have to be any rhyme or reason other than human psychology. It pops into your mind.

COWEN: Where would you pick? Ess-A-Bagel? Somewhere, right?

PINKER: I think it would be probably Times Square, maybe the statue of, what’s his name, Sammy Cahn. There’s a grandstand in Times Square with a statue of the Broadway promoter.

COWEN: I don’t think you’re going to meet many people there.

PINKER: Maybe not.

COWEN: Maybe the former World Trade Center site, I thought of. Moynihan Station is not crazy.

PINKER: No. Schelling points this out. It is dependent on the context of your relationship with that person. It would be something that may have come up in conversation, something that you remarked on, a quirk that one of you has that the other one knows about. That’s just it, when it comes to a common knowledge generator or a substitute for common knowledge, really common salience, there is no predictability. There needn’t be any predictability from outsiders other than it’s likely to pop into the mind of the people involved.

On rational disagreement

COWEN: Do you agree with Robert Aumann’s argument, at least as he makes it in the model, that if you meet an epistemic peer and you each announce your views, you ought to end up agreeing with them even without debating the issue or exchanging information about the facts? The mere fact that you have these two opinions, you should be able to converge.

PINKER: Yes. I guess I do. Of course, it’s dependent on the assumptions that go into the theorem. This is the famous theorem that rational agents cannot agree to disagree. More accurately, rational agents who have the same priors, that is they come to a problem with the same understanding of the situation and hence the same degree of credence in a hypothesis before they even look at any evidence, and they share their posteriors, that is, each one tells the other what their subjective estimate of that idea is, without sharing the evidence that they use to come to that assessment, just what the assessment is, then the theorem says those assessments have to be the same.

That is, rational agents with the same priors and whose posteriors or common knowledge cannot agree to disagree. Those assumptions are seldom satisfied, but if they are satisfied, then yes. How does that actually manifest itself in some real-world situation? I don’t know if it was Aumann himself, it may have been, I think, Geanakoplos, who pointed out the corollary of Aumann’s theorem, which sounds totally counterintuitive, even crazy, is that rational agents should not engage in speculative trade. That is, buying a stock in the expectation that you know that its value will rise and you can sell it at a profit.

This is exactly what all the investment analysts tell me. Don’t try to time the market, don’t try to outguess other investors, just buy an index fund and be content to share in the overall appreciation of the market. That’s a corollary of Aumann’s agree to disagree theorem based on the similar intuition, namely since prices are common knowledge, at least in a large public market, if you have insider information that changes the assumption. But since prices are common knowledge, any hint that you have that some company is coming out with an insanely great gadget or there’s going to be a pent-up demand for this or that is almost certainly already reflected in the price. You can’t reasonably expect to make a greater profit by speculating on private information.

COWEN: Am I your epistemic peer when it comes to chocolate? I’ve probably eaten a lot more of it than you have. I’ve had it in more countries. I’ve written a book on food. If I tell you chocolate is good and you shouldn’t dislike it, should you disagree with me? Will you, in fact, just agree with me?

PINKER: I won’t. I’m one of those freaks who just does not enjoy the taste of chocolate.

COWEN: Oh, that’s fine, but you can agree you should enjoy the taste of chocolate.

PINKER: That I ought to.

COWEN: You ought to. That’s right.

PINKER: Yes. I ought to. If I had the same constitutional makeup as you, then perhaps. This is a case where the epistemic peers don’t share priors. I don’t know if you could talk about the prior probability of a preference, but when it comes to just the brute force gut feeling, look, I just don’t like chocolate, that would be a case where it isn’t irrational to not share those priors. That’s the way I’m wired, and I just don’t like it.

COWEN: I don’t like most ballet, but I admit I ought to. I just don’t have the time to learn enough to appreciate it. Take Alfred Hitchcock. I would say North by Northwest, while a fine film, is really considerably below Rear Window and Vertigo. Will you agree with me on that?

PINKER: I don’t agree with you on that.

COWEN: Or you think I’m not your epistemic peer on Hitchcock films?

PINKER: Your preferences are presumably different from beliefs.

COWEN: No. Quality relative to constructed standards of the canon.

PINKER: Oh. That is, the criteria exist external to both of us. We both agree that they’re reasonable. We agree that the criteria are reasonable. We agree on everything that went on in both movies. Then we share our assessments. I put Rear Window at a 0.7, you put it at a 0.4. Is one of us irrational? Maybe.

COWEN: You’re going to budge now, and you’re going to agree that I’m right. We’re not doing too well on this Aumann thing, are we?

PINKER: We aren’t.

COWEN: Because I’m going to insist North by Northwest, again, while a very good movie is clearly below the other two.

PINKER: You’re going to insist, yes.

COWEN: I’m going to insist, and I thought that you might not agree with this, but I’m still convinced that if we had enough time, I could convince you. Hearing that from me, you should accede to the judgment.

PINKER: I should. No, that is right. Oh, okay, I’m open, but crucially, there can be no element of just raw subjective preference. It has to be, does it meet the criteria of, I don’t know, the majority of Rotten Tomatoes or Roger Ebert and Pauline Kael and a panel of their peers? Yes, if that existed, then I’d be open. I’d say, “Well, I don’t agree with Pauline Kael’s criteria, but who am I?”

COWEN: You’re not open on chocolate?

PINKER: Again, if it’s a matter of just, “Do you like the taste?” there, there’s de gustibus non est disputandum. When it comes to just preferences, I don’t think that priors apply or are even meaningful.

COWEN: Say I had that gene where cilantro, to me, tasted like soap. I don’t, but if I did, I would admit, “Well, it’s really wonderful. I just have this disability,” and you have the chocolate disability.

PINKER: Yes. If you reframed it as a descriptive statement about other people’s tastes, then it’s just a question of asking them. If it’s a question of what is better, better means I would choose it. There, it seems to me that priors don’t apply, that it’s not a question of a descriptive belief, but there, you really can have your own tastes.

COWEN: What’s the best Bob Dylan album?

PINKER: Let’s see. I would say Highway 61.

COWEN: I think I’d say Bringing It All Back Home, Blood on the Tracks, and Highway 61 is number three. They’re close.

PINKER: What about Blonde on Blonde?

COWEN: Fourth.

PINKER: Really?

COWEN: It’s a little too long.

PINKER: True.

COWEN: Some of the songs are a little harsh in terms of the sound engineering, I find, but it’s very, very good, right?

PINKER: Yes. Not to mention that in all of the songs, there are some lyrics that are rather misogynistic by today’s contemporary standards, but somehow he gets away with it even retroactively.

On fading liberal enlightenment

COWEN: Is the liberal enlightenment gone forever?

PINKER: I sure hope not.

COWEN: What do you think? Betting odds, it seems to me what we used to think of it is gone forever. We may salvage some parts of it for the new thing that will be, whatever that is.

PINKER: I guess I’ll say not gone forever, but certainly in retreat. Why not gone forever? Because it’s got some built-in advantages. One of them is a reference for science, which is the truth is what refuses to go away when you stop believing in it. Any method that is optimized for finding the truth has just that built-in advantage. It can be repressed. There can be censorship. There can be irrationality.

In general, something that finds the truth, it’s got that going for it, and universalism, namely that if liberal enlightenment depends on universal human rights, the maximization of human interests, the fact that we’re ultimately birds of a feather, we all want to be alive rather than dead, that there is something logically incoherent in saying, “I deserve rights because I’m me and you don’t because you’re not me.” I can impose that by brute force, but to the extent that we all have to agree on something, we’re not going to agree on me being imperial leader.

Now, they aren’t guaranteed to succeed because there is brute force and infirmities in human psychology and cognition, and mob effects and all the rest, nor is liberal enlightenment defenseless. I don’t believe in historical momentum. I think that’s mystical, but there are some things that do look, if not irreversible, to have an awful lot of historical momentum. Like human sacrifice. All the ancient civilizations did it. Now it’s gone. I don’t think it’s coming back.

Why? A combination of we’ve got enough knowledge about cause and effect to know that throwing a virgin into the volcano will not, in fact, bring good weather or make a plague go away. Some degree of empathy for the average person, so that we don’t want to cut the beating heart out of someone’s chest, on average. Likewise, slavery, it exists clandestinely, but we don’t have any legal slavery anymore.

COWEN: In Mauritania, don’t we? Don’t we have more slaves in the world today than we did way back when, by some counts?

PINKER: Not as a proportion of the world’s population.

COWEN: Sure, but maybe it’s the absolute number that matters. If you said to a slave, “Don’t worry, you’re only one-tenth of a thousandth of the world’s population,” it wouldn’t be that much of a consolation.

PINKER: No, but if you said to a randomly selected person, “Almost certainly you’re not going to be a slave,” then that does matter. From behind a veil of ignorance, if an average person has some likelihood of being a slave, and that’s way down, then that matters. In any case, legal slavery, that is enforceable by the state and carried out in open slave markets, no longer exists. Will it come back? I think probably not. There are other things where it’s not inexorable, it’s not monotonic, but it certainly seems to have a direction.

That doesn’t guarantee that it continues because there is no such thing as literal momentum, but it does indicate that as there are greater means of exchanging information, as there are wider and wider pools of people exchanging information and having to agree on common interests, there is some force, some vector. Now, again, I don’t want to say it’s going to win, just that it exists. It’s not that liberal enlightenment, I think, is gone forever. It isn’t even gone. More countries are democratic than not, even though if it’s fewer than it was 20 years ago.

Liberal democracy is on its heels. There is some retreat, but we’re still better off than we were in the ’90s, the ’80s, the ’70s, the ’60s, the ’50s in terms of liberal democracy. I don’t think it’s gone forever.

COWEN: I think my view would be science will do fine, but even the Enlightenment proper did not have universalism to begin with. The future for universalism, to me, looks quite grim. I think we will have a steady stream of low-level conflict across major powers. China does all these cyberattacks on us. They would have been acts of war way back when, and we just put up with it. Russia sabotages EU infrastructure. That might have been an act of war way back when. We just put up with it. We’ll just be creeping up, up, up that threshold, not to nuclear war, but we’ll get as close as we can get without deterrence really having to kick in. The world will just be more violent and have more conflict. Why is that wrong?

PINKER: I think that’s interesting because, as you say, it doesn’t result in war, and war would be much worse. The invasion of Ukraine, rewinding the clock 100 years, that could have been like the assassination of the archduke in Sarajevo. It could have been that NATO decided to go all in, and something like World War I or even worse could have happened. As awful as Ukraine is, and that is awful, it’s not as bad as World War I.

If these cyberattacks, these insults, because of a norm threatened by Putin, no doubt, and by Xi, that you don’t invade your neighbors, you don’t try to rearrange borders by force, we haven’t had another world war. If what you said is correct, things could suck, but still, they could suck much worse if we had a world war. I think it is less likely than it was 100 years ago.

COWEN: If you had to model in as fundamental and abstract a set of terms as possible, not relying on the names of individuals very much, why liberal enlightenment is in retreat today, what does your model look like?

PINKER: I think the most basic answer is that liberal enlightenment is not particularly intuitive. It doesn’t really fit our human nature in the sense of our conception of how our societies ought to be organized. What’s much more natural is tribalism, deference to authority, conformity. I think liberal enlightenment is a great idea. It’s like a lot of counterintuitive discoveries of organized rationality. It is commendable, even if not intuitive. I think there’s a natural backsliding. Unless it’s constantly propped up, people are going to say, “Well, the solution to our problem is we need a strong leader.”

What is a strong leader? Once he’s a leader, he gets to say what’s what. The idea that a leader just temporarily presides and is bound by fiduciary duties laid out with checks and balances and safeguards, it’s a fabulous idea. It’s not particularly intuitive, in the same way that markets are not particularly intuitive, in the same way that science is not particularly intuitive, but they’re still good things. It’s natural to think, why do people have superstition, belief in paranormal, belief in woo-woo? Why do they not grasp certain economic fundamentals, like trade is good, it’s not zero-sum? Why is congestion pricing in Uber rational, even though people hate it?

There are probably a lot of cases where people, just left to their own devices, backslide away from liberal enlightenment principles.

COWEN: Do you think we have stirring liberal thinkers of today who are younger than 55 years old?

PINKER: Not stirring enough. I think we do have such thinkers. Oh, Derek Thompson, Maarten Boudry, Cathy Young. Although I guess I haven’t kept track of her age. She’s younger than 70.

COWEN: Good.

PINKER: Yes, there are young people. I don’t know. Ezra Klein. I’m talking now about journalists. If you go into academia, there would probably be even more. There are other people that I think our audience may not have heard of that are in my email correspondence. Are they stirring enough? Are they like whatever Slavoj Žižek has? Do they have it? Probably not. Although that might be an inherent problem in that influence by charisma goes against the whole idea of liberal enlightenment, where it ought to just be the quality of the content of your ideas, not who looks more dashing in a scarf. Liberal enlightenment is at a disadvantage in that regard, in that it oughtn’t to rely on charismatic thought leaders.

COWEN: How much of a liberal was Napoleon?

PINKER: Not. Wait, no, let me back up there.

COWEN: Wasn’t he quite a liberal? He just loved war too much, and he was an egomaniac, and he wanted to take over other countries, and that killed millions, but wasn’t a lot of his program quite liberal?

PINKER: No, you’re right. There were certain rethinking institutions and laws introducing rational things like the metric system. It could be that his own malignant narcissism subverted the liberal parts of him.

On grading Harvard students

COWEN: Doesn’t everyone at Harvard deserve an A- or an A? They made it into Harvard, right? Same with Stanford.

PINKER: Yes. I actually have data on this because that’s what our dean told us, not with a whole lot of conviction, when we had a meeting about grade inflation. She said, “Well, we like to think of it as grade compression. Maybe students are just getting better and better and better.” I just know that that’s not true because I have records from performance on my multiple-choice test in my intro psych course, which I’ve taught for more than 20 years. It’s pretty constant year to year, there’s no subjective sliding grade, and the performance has gone down.

Students are not doing as well as they used to. I’ve been forced, just so as not to drive away all my students, to inflate my grades. I give way more A’s than I used to, just so I don’t have an empty lecture hall. It’s not because the students are getting better and better. We know that a lot of Harvard students are admitted on nonmeritocratic criteria. Forget about the whole hot button of racial preferences, but legacies, athletics, social action, they sorted clothes for the homeless, they circulated a petition, they hung out with artists in an artist’s colony. Harvard does not admit students on meritocratic grounds.

COWEN: If you’re the boss, what would the median grade at Harvard be?

PINKER: Median grade?

COWEN: Median grade, yes.

PINKER: Let’s see. B-.

COWEN: How many students at Harvard, what percent do you think essentially deserve A- or A? They’re just very smart, they work hard, they’re great, maybe Harvard classes are harder, but they just ought to have good grades because they’re wonderful?

PINKER: A quarter, 25%?

COWEN: Only a quarter? I’d be shocked if it’s only a quarter. Your admissions department must be terrible then.

PINKER: It is terrible. I’ve argued that as much. They use holistic criteria, which is mumbo-jumbo. It’s mysticism. The reason I say that is I think I’m a pretty good lecturer. Everyone thinks they’re a good lecturer.

COWEN: No, I don’t doubt that. I’m sure you are.

PINKER: My lecture hall is two weeks into the semester or three weeks into the semester, it’s half empty. Anyone whose fanny is not in that seat, I think, does not deserve an A because it’s not asking a lot. It’s less than three hours a week, and they get to hear me. I don’t think I have a big ego, but people in the real world actually pay money for tickets to my events. For Harvard students, it’s already paid. They still stay away because they have more important things to do, like pulling an oar or singing in an acapella group. Do they deserve an A? I frankly think they don’t.

My lecture hall is two weeks into the semester or three weeks into the semester, it’s half empty. Anyone whose fanny is not in that seat, I think, does not deserve an A because it’s not asking a lot. It’s less than three hours a week, and they get to hear me. I don’t think I have a big ego, but people in the real world actually pay money for tickets to my events. For Harvard students, it’s already paid. They still stay away because they have more important things to do, like pulling an oar or singing in an acapella group. Do they deserve an A? I frankly think they don’t.

On linguistics vs LLMs

COWEN: Are you discouraged that linguistics has had so little impact on current large language models?

PINKER: Yes, I am. I don’t know whether it’s that linguists themselves have not made themselves useful enough, have not been part of the conversation, or the cynical view is linguistics has just been on the wrong track and it has nothing to contribute. I tend to think more of the former. There are some features in large language models that actually implicitly build in some of the insight of linguistics. The fact that you’ve got all of these hidden layers is an acknowledgment that the regularities in language and the regularities in thought that language expresses are abstractions.

They’re not triggered by individual words, but they’re patterns and patterns and patterns, groupings, and larger and larger groupings. The fact that one of the breakthroughs in this generation of large language models is what I call attention heads, which are basically processors that look at temporally separated words in search of regularities, as opposed to adjacent ones. The fact that they’re not trained on the words themselves, but on tokens that correspond to more or less to morphemes.

The fact that they’re trained on embeddings, which is not, again, not the words themselves, but some backhanded indirect representation of meaning. In that case, the context that a word typically occurs in. There are a lot of discoveries of linguistics that are implicitly built in, but linguistic theory in the sense of what you see in a journal like Linguistic Inquiry, don’t have direct counterparts. Maybe they wouldn’t need all of those gigawatts and all of those GPUs if they were able to cut to the chase quicker by building in linguistic insights. My hunch is yes, but the models are so complex that I can’t say for sure.

COWEN: Does it favor Chomsky? Does it favor the view that language is simpler than we thought? Where does it push your intuitions?

PINKER: It has changed my intuitions that these models are capable of more than I would have guessed that they’re capable of, given how they’re built, but that may show that my intuitions break down when it comes to billions, tens of billions, hundreds of billions of parameters, 90 hidden layers, training that would be the equivalent of 30,000 years if a child was listening to it. My intuitions may be useless of what you can accomplish at those scales, which leads to an ignorance of all of us so far until we probe the models to find out what’s in the black box and what’s responsible for their intelligence.

Does it mean that the linguists and cognitive psychologists were wrong all along in saying that you need rules, algorithms, propositions, mental models? That it’s all just massive statistical patterns, or is the discrepancy between the fact you have to train these models with the equivalent of 30,000 years and a human child can speak in two years, say that it’s almost a coincidence that they’re as capable as they are? They’re capitalizing on all this input that we humans have put out there, and that they’re doing something human-like in a very different manner than the human brain.

I tend to think the latter, and the fact that they are so vulnerable to confabulation, hallucination, blends, as my former student, Gary Marcus, insistently points out, is a hint that they’re doing things in a different way than the human brain is doing things.

COWEN: How long do you think it will be before they can write a poem, say, as good as the median poem by Pablo Neruda?

PINKER: The thing is that the fact that Pablo Neruda didn’t write it itself goes into our assessment of how good a poem it is.

COWEN: A blind taste test, right?

PINKER: Yes, a blind taste test. I don’t know. They’re already better at doing things, probably like composing limericks or higgledy-piggledies on the fly. They’re not great literary quality, but they are often very stringent formal criteria. They’re remarkably good at that. Again, going back to the North by Northwest versus Rear Window, what goes into assessment of quality, and to what extent is there such a thing as objective quality of works of art?

I tend to think that our assessment of a work of art isn’t completely extricable from our knowledge of its provenance, which is why a fake da Vinci, an imitation da Vinci, is just not the same as a real da Vinci in terms of the experience, in terms of any judgment. How it’s produced goes into our assessment of what kind of work of art it is.

COWEN: Before my last two questions, let me just plug your book again. It is called When Everyone Knows That Everyone Knows . . . The dots are important. The subtitle is Common Knowledge and the Mysteries of Power, Money, and Everyday Life. A fascinating read. Last two questions. First, what’s your favorite song on Rubber Soul? Mine is “You Won’t See Me.” I’ll announce in advance, but I know it won’t change what you say.

PINKER: I really like that song. I think “I’ve Just Seen a Face.”

COWEN: It’s a good pick. I like the bluegrass rhythms behind it all.

PINKER: Yes, and just the insistent rhythm and the piling up of line on line.

COWEN: Final question. What will you do next?

PINKER: Oh, one possibility is to revisit all the indicators of progress. What’s the report card on the world 10 years after I published Enlightenment Now? What progress has continued? Where have we backtracked? In a sense, answer your question, what are the prospects for liberal enlightenment?

COWEN: Steven Pinker, thank you very much.

PINKER: Thanks so much, Tyler. Always a pleasure.

This episode was made possible through the support of the John Templeton Foundation.