Arabic Leaderboards has launched a new platform to centralize evaluations of Arabic AI models, featuring updates to the AraGen benchmark and the introduction of the Arabic Instruction Following leaderboard. The AraGen-03-25 release includes expanded datasets and improvements in evaluation methodologies, emphasizing the need for accurate assessments in Arabic language tasks. Ongoing analysis of ranking consistency among models highlights the robust nature of the evaluation framework amidst dynamic updates.