
Filter papers
One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers
multilingual
Language Models
Pre-Training
Efficiency
multilingual
Language Models
Pre-Training
Efficiency
The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It
multilingual
Safety
survey
multilingual
Safety
survey
The Multilingual Divide and Its Impact on Global AI Safety
multilingual
Safety
AI Policy
multilingual
Safety
AI Policy
How to Improve the Robustness of Closed-Source Models on NLI
Language Models
Robustness
Collaboration
Language Models
Robustness
Collaboration
Reality Check: A New Evaluation Ecosystem Is Necessary to Understand AI's Real World Effects
Evaluation
AI Policy
Collaboration
Evaluation
AI Policy
Collaboration
Reverse Engineering Human Preferences with Reinforcement Learning
Evaluation
Reinforcement Learning
Evaluation
Reinforcement Learning
Reasoning
Aya Vision: Advancing the Frontier of Multilingual Multimodality
multilingual
Language Models
Multimodal
multilingual
Language Models
Multimodal
Crosslingual Reasoning through Test-Time Scaling
Reasoning
multilingual
Language Models
Reasoning
multilingual
Language Models
The Leaderboard Illusion
Evaluation
Language Models
Evaluation
Language Models
Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation
multilingual
Evaluation
Language Models
multilingual
Evaluation
Language Models
Kaleidoscope: Exams for Multilingual Vision Evaluation
Evaluation
Open Source
multilingual
Generative Models
Multimodal
Evaluation
Open Source
multilingual
Generative Models
Multimodal
Command A: An Enterprise-Ready Large Language Model
Language Models
Language Models
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions
Code
Collaboration
Evaluation
Reasoning
Tooling
Code
Collaboration
Evaluation
Reasoning
Tooling
Policy Primer - Efficient AI
AI Policy
Compute
Data Efficiency
Model Compression
AI Policy
Compute
Data Efficiency
Model Compression
Fairness of Deep Ensembles: On the interplay between per-group task difficulty and under-representation
Computer Vision
Responsible AI
Computer Vision
Responsible AI
Policy Primer - Translating Safety
AI Policy
multilingual
Safety
AI Policy
multilingual
Safety
If You Can't Use Them, Recycle Them
Language Models
Optimization
Language Models
Optimization