CULTURE-MT Leaderboard
CULTURE-MT is a benchmark for evaluating culturally-aware social media translation.
Participants submit English translations for the benchmark inputs. Submissions are evaluated by a private Judger API, and the aggregated scores are displayed on this leaderboard.
Current Results
10 | CULTURE-MT-baseline-32B | ZJU & FDU & Xiaohongshu | Deepseek V3.2 | Base | 12.08 | 91.92 | 12.31 | 11.35 | 51.56 | 40.35 | 2.31 | 2026-05-27 22:38:47 |