External Publication

Fine-Tuning an SLM for a Low-Resource Language

Hugging Face Forums [Unofficial] June 6, 2026

Oh. I don’t know Persian myself, but Gemma is basically like Gemini’s younger sibling, so my guess is that it may be fairly strong for Persian too. Also, leaderboards can make the search much easier: * Open Persian LLM Leaderboard / OPLL Open Persian LLM Leaderboard - a Hugging Face Space by opll-org * Another Open Persian LLM Leaderboard mirror / variant Open Persian LLM Leaderboard - a Hugging Face Space by PartAI * MIZAN: Persian LLM Leaderboard MIZAN: A Persian LLM Leaderboard - a Hugging Face Space by MCINext * ParsBench GitHub - ParsBench/ParsBench: ParsBench provides toolkits for benchmarking LLMs based on the Persian language tasks. · GitHub ParsBench (ParsBench) * FaMTEB, if embeddings / RAG are relevant [2502.11571] FaMTEB: Massive Text Embedding Benchmark in Persian Language * PersianPhi may also be worth checking as a Persian-adapted compact model amirakhlaghiqqq/PersianPhi · Hugging Face So, if me, I’d probably use those leaderboards to make a shortlist, then compare tokenization and a small private Persian eval before choosing the base model.

Discussion in the ATmosphere