synthesis

Findings

What the field looks like once you stop reading parties one at a time and start reading the whole map. Every figure here is computed live from the dataset, so it moves as the dataset moves.

provisional · v0.2 — hand-coded, computed live; read the caveat below

0.97

the correlation tying Liberal democracy to Civil liberties — the tightest bond in the field. The strongest correlations cluster into two groups, around the Islamist–secular and regime-stance dimensions.

45%

of the 98 parties on the map are banned or hold no seats. Across much of the region the most consequential actors are the ones the state is trying to shut out — so the dataset keeps them in.

97%

of the cases where a party's rhetoric and record diverge, the rhetoric runs to the party's stated advantage. Manifestos overstate the governing record more often than they understate it.

The cross-axis structure

Of the 120 unique axis pairs, the 5 strongest correlations cluster into two groups: an Islamist–secular cluster (religion-in-state ↔ Palestinian-question ↔ regime-stance) and a regime-stance ↔ democratic-norms cluster. The most independent pairs run orthogonal to these — economic / federalism / regional-stance carry distinct explanatory power. Pairs are limited to those sharing at least 30 scored entities, so a thin scoped axis can't manufacture a headline bond.

Strongest correlations

Axis AAxis Brn
Liberal democracy Civil liberties +0.97 81
West alignment Regional stance +0.96 81
Social State & religion +0.95 83
State & religion Traditionalism vs modernization +0.91 78
Social Traditionalism vs modernization +0.91 78

Most independent

These pairs do distinct explanatory work — knowing a party's score on one tells you little about its score on the other.

Axis AAxis B|r|
State & religion Regime stance 0.01
Economic Centralism vs federalism 0.03
Social Regime stance 0.04
Economic Liberal democracy 0.04
Pan-Arab vs particularist Traditionalism vs modernization 0.05

Where parties cluster vs. where they spread

The standard deviation of scores on each axis reveals where MENA parties genuinely disagree (high σ) and where they reach near-consensus (low σ). Compass-axis cleavages run wide; some issue axes reach near-consensus across the region. These rankings run over the 12 core axes — the ones scored on at least 30 entities, enough to read a field-level pattern from. The 4 sparser scoped axes are held out into their own table below rather than averaged in: with a handful of points one outlier swings σ and "most contested" stops meaning contested. Why.

Widest spread (most contested)

Field spread (σ across parties) with the panel's mean within-cell disagreement beside it. An axis that is wide and rater-contested is unsettled on two counts; one that is wide but tightly rated is a real, agreed cleavage.

AxisσMeanRangePanel σ
State & religion 6.47 +0.12 19.0 1.62
Regime stance 6.14 +2.11 19.0 2.42
West alignment 5.62 +0.58 18.5 1.65

Tightest spread (consensus)

AxisσMeanRange
Pan-Arab vs particularist 4.20 +0.84 18.8
Centralism vs federalism 4.46 -3.56 18.5
Economic 4.54 -0.67 17.0

Most-skewed mean (regional consensus toward one pole)

AxisMeanDirection
Palestinian question +5.73 Pro-Palestinian rights
Centralism vs federalism -3.56 centralist
Regime stance +2.11 pro-regime

Within the core, Pan-Arab vs particularist, Regime stance are well-covered but rater-contested (panel σ above 2.3): enough entities to rank, but the panel splits on the cell scores, so the composite there is an estimate, not a fixed point.

Peripheral axes — reported, not aggregated

These scoped axes are scored on fewer than 30 entities — too few to anchor a field-level claim — so they sit out of the rankings above and the headline correlations. They're shown in full, panel disagreement included, because the sparsity is the point: it marks where coverage is thin, not where the region agrees.

AxisnσPanel σMean
Sectarian power-sharing 11 3.11 3.53 -4.45
Iran posture 19 7.67 1.35 -1.63
Press freedom 21 5.55 1.55 -2.45
Gender equality 22 5.97 1.28 -2.95

Distributions across the 16 axes

Each row shows the histogram of party scores binned at integer intervals on a fixed [-10, +10] scale. Blue bars = negative scores (left pole), red bars = positive (right pole), sage = zero (centre). The shape of each distribution makes the spread / consensus / skew patterns visible at a glance.

Structural status of MENA party politics

The government_role distribution captures something compass scores can't: the operational political space MENA parties live in. 44 of the 98 parties on file (45%) are banned or extra-parliamentary. Only 10 are lead parties.

  • 10 lead parties — the dominant force in their country's government
  • 16 in coalitions (major / minor / confidence-supply)
  • 28 in opposition
  • 39 extra-parliamentary — legal but no seats
  • 5 banned / outlawed

Banned, dissolved, restricted

5 outlawed parties currently on file (1 in IQ; 1 in DZ; 2 in IL; 1 in SY); 8 dissolved; 0 restricted. Inclusion of outlawed entities is deliberate — for many MENA contexts, the most consequential political actors are also the ones the state is trying to suppress (Northern Islamic Movement in IL, FIS in DZ, FJP in EG, Al-Wefaq in BH).

Where parties concentrate

The top 5 countries by party count carry the bulk of the dataset's analytical weight. Coverage in the others is editorial: a chosen anchor set, not a comprehensive parliamentary roster.

  • 27 parties — IL flag IL
  • 9 parties — EG flag EG
  • 7 parties — TN flag TN
  • 7 parties — PS flag PS
  • 6 parties — LB flag LB

Per-axis poles

The party at each end of every axis — the lowest (most negative) score and the highest (most positive). Useful for calibration: if either pole party feels meaningfully wrong at its stated extreme, the axis rubric or the scoring deserves another look. Note that on some axes both poles sit on the same side of zero (e.g. democracy, where even the most pro-democracy party scores well below the most anti-democracy one).

AxisMost negativeMost positive
Economic Tunisian Workers Party (-9.0) Free Egyptians Party (+8.0)
Social Noam (-9.0) The Democrats (+8.0)
State & religion Islamic Salvation Front (-10.0) Tunisian Workers Party (+9.0)
Liberal democracy Ansar Allah (Houthis) (-9.0) The Democrats (+8.5)
West alignment Ansar Allah (Houthis) (-10.0) National Unity (+8.5)
Regional stance Ansar Allah (Houthis) (-10.0) National Unity (+7.0)
Palestinian question Otzma Yehudit (-10.0) Balad (+10.0)
Civil liberties Ba'ath Party (Syria) (-9.0) Hadash (+9.0)
Regime stance Syrian National Coalition (-9.0) Ba'ath Party (Syria) (+10.0)
Pan-Arab vs particularist Kurdistan Democratic Party (-9.0) Ba'ath Party (Syria) (+9.8)
Centralism vs federalism Ba'ath Party (Syria) (-9.0) Kurdistan Democratic Party (+9.5)
Traditionalism vs modernization Noam (-9.0) The Democrats (+7.5)
Gender equality Islamic Salvation Front (-9.0) The Democrats (+8.0)
Iran posture Likud (-10.0) Hezbollah (+10.0)
Press freedom Ansar Allah (Houthis) (-9.0) The Democrats (+8.0)
Sectarian power-sharing Amal Movement (-7.0) Sairoon Alliance (+4.0)

Declared vs. behavioral — where rhetoric and record split

For 20 parties across 8 axes, the dataset carries both a declared score (manifesto-stated position) and a behavioral score (voting record / governance behaviour). The gap between them is the measure of how much a party's rhetoric overstates or understates its actual record. Across all 36 pair rows, the mean absolute spread is 6.08 on the −10 to +10 scale (max 12), and 35 of 36 point one direction: declared is more positive than behavioral — the rhetoric overstates the record more often than understates it.

Largest divergences

Top 8 (party, axis) pairs by absolute gap. Negative spread (red) means the party's record runs to the left of its rhetoric on that axis; positive (green) means the record runs further right.

PartyAxisDeclaredBehavioralΔ
LB flag Hezbollah Sectarian power-sharing +3.0 -9.0 +12.0
LB flag Free Patriotic Movement Sectarian power-sharing +2.0 -9.0 +11.0
PS flag Hamas Liberal democracy +4.0 -7.0 +11.0
YE flag Ansar Allah (Houthis) Liberal democracy +3.0 -8.0 +11.0
EG flag Mostaqbal Watan Liberal democracy +3.0 -7.0 +10.0
DZ flag National Liberation Front Liberal democracy +5.0 -5.0 +10.0
LB flag Hezbollah Liberal democracy +3.0 -5.0 +8.0
EG flag Freedom and Justice Party Liberal democracy +6.0 -2.0 +8.0

Toggle the lens chips at the top of the compass to see how it moves between declared and behavioral views for these parties.

Inter-rater reliability — the reproducibility pilot

The panel has layers that are easy to conflate, so to be explicit: author-baseline is the hand-coded prior; r1 (Gemini, Grok, no definitions) is a locked "before", kept only to measure whether the rubric changed the scores, and excluded from the published median; r2 (nine models — Claude, Gemini, GPT, Gemma, Grok, Kimi, DeepSeek, Qwen, MiniMax — with the definitions and the declared/behavioral rule) is the live multi-model panel, landing country by country and shown on /ratings; a grounded pilot (one model scoring from cited documents) seeds the document-verification round. The κ below is an earlier, separate coder pilot. The formal multi-model κ over r2 lands once coverage is broad enough to be representative — a two-country panel would mislead.

The dataset's hand-coded composite scores are the Pilot Phase baseline. To check whether those scores are reproducible from the published rubrics rather than idiosyncratic to one coder, a stratified sample of (party, axis) cells was re-coded across 4 rounds by 2 independent second-pass coders, blind to the baseline scores. Quadratic-weighted Cohen's κ and Pearson r measure agreement; mean |Δ| is the average gap on the −10..+10 scale.

This is an internal reproducibility check, not external expert validation — both coders share the project's rubric framework, so the absolute agreement runs high and should be read as a floor, not proof. The signal worth acting on is the relative pattern across axes. External multi-coder validation by independent domain experts is a v0.3 milestone.

Headline: 102 second-pass ratings over 51 cells · 7 axes · overall κ 0.90 , Pearson r 0.91 , mean |Δ| 1.77 points.

Pairwise agreement

Each second-pass coder against the baseline, plus the two coders against each other.

Pairnκ (quad.)Pearson rmean |Δ|
Canonical ↔ coder-b 51 0.88 0.91 2.12
Canonical ↔ coder-c 51 0.92 0.93 1.43
coder-b ↔ coder-c 63 0.95 0.97 1.48

Agreement by axis — hardest first

Baseline vs. pooled second-pass ratings. The governance axes carry the most coder disagreement.

Axisnmean |Δ|Pearson rκ (quad.)
Regional stance 14 3.75 0.61 0.61
Liberal democracy 14 2.36 0.92 0.83
West alignment 16 1.81 0.97 0.91
Economic 16 1.56 0.91 0.89
Social 14 1.25 0.97 0.95
State & religion 14 1.14 0.98 0.97
Palestinian question 14 0.57 0.99 0.99

What the pilot found

The conflict-defining axes — Palestinian question and West alignment — reproduce almost perfectly (mean |Δ| under half a point). These are the axes where a party's position is loudly and repeatedly stated, so two coders reading the same rubric land in the same place.

The governance axes — democracy and economic — carry roughly 2–3× the disagreement (mean |Δ| ≈ 1.2 vs. ≈ 0.4). Coding "commitment to liberal democracy" for an Islamist-electoral party, or "market vs. statist" for a movement whose economics are subordinated to ideology, demands contextual judgment the rubric doesn't yet fully pin down. That is precisely the kind of weakness a pilot exists to surface.

Action: the democracy and economic rubrics get an anchor- tightening pass — more worked MENA exemplars at each scale point — before the dataset scales past the pilot countries. The conflict axes are validated as-is.

What's next

These are the patterns the current data shows. The next methodological step shifts how scores get assigned — from author-hand-coded against secondary literature to derived from reading party platforms, speeches, and voting records, with cited passages. The interesting result, when that lands: where the two methods agree on a party's position, and where they pull apart. More on methodology.

Inter-method agreement

For every (party, axis) where both a hand-coded composite score and a document-grounded declared/behavioral score exist, the pair tells us whether the two methods agree. Mean absolute difference (lower is better) and Pearson r (higher is better) per axis.

Overall: 19 parties have both a hand-coded and a document-grounded score on at least one axis. Mean absolute difference across all paired (party × axis) cells: 7.05 (on the −10 to +10 scale).

Axisn pairsmean |Δ|r
Liberal democracy 11 9.77 +0.93
Civil liberties 8 7.44 +0.36
Gender equality 5 2.40 +0.54
State & religion 4 3.38 -0.29
Sectarian power-sharing 3 8.33 +0.19
Traditionalism vs modernization 1 5.00
Pan-Arab vs particularist 1 10.00

Mean |Δ| under 2 means the two methods are within one rubric anchor of each other — functionally agreeing. r above 0.6 means they rank parties in similar order on the axis even when absolute scores differ. Axes where both numbers are weak are the interesting cases: the rubric language probably needs work, or one of the methods is missing context the other has.

How to cite

Each record carries a retrieval date because the dataset is live — individual entries update as verification deepens. Use the per-record citation when referencing this specific profile; use the dataset citation below when referencing the project as a whole.

In-text: (Gara, 2026)

APA 7Reference list · academic default
Gara, T. (2026). Findings — MENA political-position dataset synthesis [Dataset entry]. Tayyar: A MENA political-position dataset. Retrieved June 21, 2026, from https://tarekgara.com/tayyar/findings
Chicago author-dateCommon in political-science journals
Gara, Tarek. 2026. "Findings — MENA political-position dataset synthesis." Dataset entry, Tayyar: A MENA political-position dataset. Accessed June 21, 2026. https://tarekgara.com/tayyar/findings.
BibTeXFor LaTeX / Zotero / reference managers
@misc{tayyar-findings-page,
  title     = {{Findings — MENA political-position dataset synthesis}},
  author    = {Gara, Tarek},
  year      = {2026},
  publisher = {Tayyar: A MENA political-position dataset},
  type      = {Dataset entry},
  url       = {https://tarekgara.com/tayyar/findings},
  urldate   = {2026-06-21},
  note      = {First-pass entry; second-pass external review planned before publication.}
}

If you're citing Tayyar as a project rather than this individual record.

APA 7Preprint
Gara, T. (2026). Tayyar: A MENA political-position dataset [Preprint]. Retrieved June 21, 2026, from https://tarekgara.com/tayyar/paper
BibTeXPreprint
@unpublished{tayyar-preprint,
  title  = {{Tayyar: A MENA political-position dataset}},
  author = {Gara, Tarek},
  year   = {2026},
  type   = {Preprint},
  url    = {https://tarekgara.com/tayyar/paper},
  urldate = {2026-06-21},
  note   = {Living document, regenerated from the live dataset on page load.}
}

First-pass entry; second-pass external review planned before publication.