essay · 20 June 2026 · 14 min read

I asked nine AIs to map Middle Eastern politics

Western left-right doesn't fit the region. So I built Tayyar — a panel of language models that scores MENA parties on sixteen axes — then used it to dig into Israel's fault lines and a live Knesset coalition simulator. The data says the Middle East barely splits on the economy.

By Tarek Gara

The political compass you know — left to right, authoritarian to libertarian — came out of a very specific argument. “Left” and “right” are just where deputies happened to sit in a French assembly in 1789. The economic version we use today is a story about the state and the market: how much should the government own, tax, redistribute. It’s a good map for the place and the century that drew it.

It travels badly.

Drop that map on the Middle East and most of the interesting arguments fall straight through the cracks. The loudest fights in the region aren’t about marginal tax rates. They’re about religion and the state. About the regime — are you with it or against it. About which way a country faces: Tehran, Washington, Riyadh, no one. A party that reads as “moderate” in one country is the regime’s party; the same word across the border belongs to the people the regime put in prison. “The left” in Cairo, Beirut, and Baghdad is three different animals wearing one label.

I kept wanting a map that took the region on its own terms, and I couldn’t find one I trusted. So I built it. It’s called Tayyar — تيّار, Arabic for “current,” as in a political current — and it lives over here.

Small confession up front: I read these platforms in the languages they’re written in. Arabic is my first language and Hebrew is around me every day, so the whole thing renders right-to-left and keeps the original text sitting next to the translation. That mattered to me more than it probably should have.

The awkward part: who decides where a party sits?

Here’s the problem with any political map. Someone has to place the dots. And the instant a person places them, you are looking at that person’s politics at least as much as the party’s. One coder is one worldview. A roomful of coders is slow, expensive, and usually still shares a worldview.

So Tayyar does something that sounds, I’ll be honest, slightly unhinged the first time you say it out loud: it asks a panel of large language models to score the parties.

Not one model — a panel. Nine of them, five built in the United States and four in China. Each one reads a party’s documented positions, against a fixed rubric, and places it on every axis from −10 to +10, with the party’s name kept out of the way wherever possible so it’s scoring the substance, not the brand. The published number is the median of the panel — the nine models, plus a hand-coded baseline of my own tossed into the same pot as one more anchor.

I know how that lands. Asking chatbots to grade political parties is not self-evidently a serious thing to do. Two reasons I think it holds up anyway:

First, it shows its work. Every single score opens up to reveal the reasoning behind it and the exact passages the model leaned on. Click any cell and you can see why. It’s reproducible — run it again, get the distribution again. A lone human expert can’t hand you their priors as a file you can audit.

Second, and this is the part I didn’t expect to enjoy: when the models disagree, that isn’t noise to be averaged away. It’s a measurement. More on that in a minute.

None of this makes the numbers true. They are estimates of documented positions, not verdicts on anyone, and emphatically not endorsements — the entire thing is built to describe what a party says and does, never whether it’s good. The panel agrees with itself at around 0.86 on the standard reliability scales: solid, not scripture. This is a v0.2. Treat it like a draft that argues back.

What the data actually says

As I write this, Tayyar holds 98 parties (83 of them placed on the full axis system), 274 politicians, across 20 countries and 16 axes, grounded in 286 source documents. Here’s the part I find genuinely interesting.

Chart

Where the region leans, and how split it is

Each bar spans one standard deviation around the mean party position on that −10 to +10 axis; the dot is the mean. A wide bar means the region is genuinely split; a narrow bar means broad agreement. Highlighted: the most divided axis (State & religion) and the tightest (Economic). Tayyar v0.2 · 83 parties

Look at where the bars are widest. The region splits hardest on religion and the state, on the regime, and on alignment with the West. Those are the axes where parties are flung clear across the scale — a real argument, with populated opposite ends.

Now look at the bottom of the chart. The economy — statist versus market, the exact thing a Western compass treats as the axis — is the tightest well-covered axis in the whole dataset. Parties bunch up near a mild-statist center and mostly don’t fight about it. The question the imported map makes the main event is, here, close to settled.

The region's axes, ranked from most contested to least. 'Spread' is the standard deviation of party positions on the −10 to +10 scale — bigger means more split.
Axis	The two poles	Where the region leans	Spread
State & religion	religious state ↔ secular	split down the middle	6.4
Regime stance	anti-regime ↔ pro-regime	slightly pro-regime	6.2
West alignment	anti-Western ↔ pro-Western	evenly split	5.7
Gender	patriarchal ↔ equality	leans patriarchal	5.4
Palestinian question	opposed ↔ pro-Palestinian rights	strongly pro-Palestinian rights	5.0
Economic	statist ↔ market	mild-statist center	4.6
Centralism vs federalism	centralist ↔ federalist	clearly centralist	4.5

That one fact is the whole reason a borrowed left-right line flattens the region. It spends its entire horizontal resolution on the question everyone roughly agrees about, and crushes the three or four questions they’d actually come to blows over into a vague “social” smudge.

A couple of other things fall out of the numbers:

The closest thing to a regional consensus is the Palestinian question. It’s the most lopsided axis in the dataset — parties lean hard toward pro-Palestinian-rights positions, more one-sidedly than on any other single issue.
There’s a quiet, broad lean toward the centralist state (against federalism), and a more sobering one toward patriarchal traditionalism on gender. Those are consensus directions, not fault lines. The region argues loudly about God and the regime; it largely agrees on the strong central state.

Now zoom into one country

Averages across twenty countries hide as much as they show. So let me do the thing the tool is actually for, and zoom into one place. Israel — which carries 27 parties in the dataset, 19 of them placed — is the most internally divided country in the whole set, and it divides in a way that turns out to be very revealing.

Start with the headline regional finding: the Palestinian question is the region’s most one-sided axis. Parties across twenty countries lean hard, together, toward pro-Palestinian-rights positions. Now look at the same axis inside Israel alone:

Chart

The Palestinian question: the region, then Israel

The same axis, two populations. Across the region, parties cluster toward one pole — the most lopsided axis in the dataset. Inside Israel, the very same axis is the single most divided one: parties span the entire scale, centered near zero. Israel is the region's outlier on the region's defining question. Tayyar v0.2

That flip is the whole point of looking country by country. The question that unites the region is the question that splits Israel down the middle, all the way to both poles. And it isn’t alone — Israel’s two deepest internal divides are exactly the two you’d guess if you’ve followed its politics for five minutes: the security cleavage (the Palestinian question, a standard deviation of 7.8, parties stretched from −10 to +10) and the religious cleavage (state and religion, 7.1, sitting dead-center because the secular and the devout are both fully represented and pulling in opposite directions).

Israel's internal spread on each axis, beside the regional spread. Israel splits harder than the region on almost everything — except the economy, where, like everywhere, it mostly doesn't.
Axis	Israel	Region	The cleavage
Palestinian question	7.8	5.0	the security divide — its deepest, and the region’s most lopsided
State & religion	7.1	6.4	secular vs religious, split dead-center
Regime stance	6.8	6.2	the system vs those who would remake it
Gender	6.7	5.4	liberal vs traditional, fully contested
Economic	4.4	4.6	as everywhere, not the fight
Centralism vs federalism	3.7	4.5	a unitary state, near-consensus

And there, again, at the bottom: the economy. Even in Israel — even with a famously combative politics — statist versus market is the calmest well-covered axis, tighter than the region’s. Israeli elections are not fought over the size of the state. They’re fought over religion and security. Which is exactly why forming an Israeli government is the hardest coalition puzzle in the democratic world, and exactly why I couldn’t resist building a machine for it.

What you can build on a map like this

Once you have every party placed on sixteen axes, you can do more than draw a map. You can simulate the thing the map is really about: power.

So there’s a second tool, a Knesset election simulator. It looks nothing like Tayyar — it’s a cool-blue election-night situation room rather than a warm research notebook — but underneath, it runs on Tayyar’s numbers.

The mechanics are the real thing. You slide each list’s share of the vote; anything under the 3.25% threshold wins nothing; the rest go through D’Hondt into a 120-seat hemicycle, and a meter tracks the 61 seats you need to govern. That part is just electoral arithmetic, and it’s exact — feed it the real 2022 vote and it reproduces the real Knesset, down to the surplus-vote pacts.

The part that uses Tayyar is the coalition. When you assemble a government, the simulator doesn’t just ask do they have 61? It asks do these parties actually agree on anything? — by reading their positions straight out of the dataset and scoring the coherence of the group. And here the data earns its keep. A coalition of the religious right (its lists tightly bunched on state-and-religion and on the Palestinian question) lights up as coherent. The 2021 “change” government — secular centrists plus the secular hard-right plus an Islamist Arab party, a coalition that spanned nearly the entire 7-point spread on both of Israel’s deepest axes — lights up as exactly what it was: a government that agreed on one thing (who shouldn’t be prime minister) and nothing else. The arithmetic says 61. The positions say good luck. That gap is the whole story of the last five Israeli elections, and you can watch it on a coherence meter.

Then there’s the forecast. Hit “Run 6,000 elections” and it runs a Monte-Carlo: each simulated election perturbs the parties along three latent factors — a national left-right swing, a religious-secular swing, and Arab turnout — and the amount each list moves with each factor is loaded from its Tayyar position. So ideologically close lists rise and fall together, the way real polling moves, instead of drifting independently. It isn’t a vibe. It’s calibrated on 63 final-week polls from the 2019–2022 elections: tuned so the model’s per-party error matches the real one (about 1.66 seats), with the measured polling biases (Haredi and Arab lists tend to be under-counted) corrected behind a toggle you can switch off. In backtests, the actual result lands inside the model’s 80% band 85% of the time.

My favorite feature is the one that tries to embarrass it. You can rewind to a past election and ask could the model have called it? — feed it only that race’s final-week polls and let it run blind. It calls the 2022 right-religious sweep comfortably. It’s honestly, visibly wrong about the 2019 deadlock — it over-counts a centrist list toward the right-wing bloc that, in reality, refused to sit with them — and it shows you the miss rather than hiding it. A model that only ever agrees with the past is just memorizing it.

A detail I love, because it captures how knife-edge this all is: in 2022, Meretz missed the 3.25% threshold by 0.09 points and Balad by 0.34. Two left and Arab lists, a third of a percent short, and their seats evaporated and flowed to the bloc they oppose. Nudge either of them over the line in the simulator and the whole map changes color. That is not a rounding error. That is how the most right-wing government in Israel’s history got its majority.

None of this is a prediction. It’s an illustrative model, the same way Tayyar is an illustrative map — built to make the arithmetic and the cleavages tangible, so you can put your hands on them, not to tell you who wins.

The models give themselves away

Back to the disagreement, because this is my favorite thing the project turned up.

Take one party. Ask a model built in San Francisco and a model built in Hangzhou to score it. On the declared axes — what the party literally says about itself — they agree closely. On the interpretive axes — how committed it really is to democracy, what its posture toward the regime amounts to, what its identity politics mean — they pull apart. And the split isn’t random. It tracks where the model was built.

In other words, the rater’s own priors leak into the scores, and you can see the seam. Most political datasets bury that: one lab, one method, the priors baked in and invisible. Tayyar runs a whole little lab on it — US-built versus China-built, agreeing on the text and diverging on the judgment. It’s the one thing this dataset has that almost nothing else does, and it only exists because I refused to use a single rater.

Go poke at it

The genuine joy of building something like this is that you can just play with it:

The compass — swap the axes, filter by family or country, watch the whole field rearrange.
The map — the region as a choropleth, shaded by whichever axis you pick.
The Knesset simulator — slide the vote, build a coalition, run six thousand elections, rewind history.
A handful of games: guess who said it, or call a head-to-head before the panel does.
Or paste your own text — a speech, a manifesto, a tweet, in Arabic, Hebrew, or English — into the analyzer and watch it get scored live, with the lines it underlined.

It’s a pilot, and it’s wrong in places. I would much rather it be wrong in public, where you can tell it where it’s wrong, than confident in private.

So here’s my actual question. Somewhere in there, a language model has misjudged a party you know better than it does. Which one — and which way did it get it wrong?

Want the methodology rather than the tour? The panel, the reliability numbers, and the rater-provenance finding are written up in a short methods paper, now on arXiv (arXiv:2606.23042) — or read the same thing on-site.

§The awkward part: who decides where a party sits?

§What the data actually says

§Now zoom into one country

§What you can build on a map like this

§The models give themselves away

§Go poke at it

Comments