essay · 22 February 2026 · 7 min read
Why AI thinks in English and forgets in Arabic
On tokenization, context windows, and what gets lost when Semitic languages are treated as bloated data.
Abstract. On tokenization, context windows, and what gets lost when Semitic languages are treated as bloated data.
Migration in progress.