Yenting Lin — Research Scientist, Google DeepMind

Experience

Google DeepMindResearch Scientist — audio LLMs, post-training, controllability
Meta GenAIEnhanced reasoning with stepwise feedback and long-form reasoning
NVIDIA ResearchError correction methods for multimodal language models
Amazon Alexa AIFactuality evaluation agent and synthetic data techniques

News

2025Released Step-KTO, stepwise binary feedback for mathematical reasoning.
2024Measuring Taiwanese Mandarin Language Understanding accepted to COLM 2024.
2024Released the latest Taiwan-LLM models, open-weight Traditional Chinese LLMs.
2023Released Taiwan-LLM, the first open LLM series built for Taiwan.

Selected Publications

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

arXiv 2025
Measuring Taiwanese Mandarin Language Understanding

COLM 2024Benchmark for Traditional Chinese LLM evaluation
LLM-Eval: Unified Multi-Dimensional Automatic Evaluation for Open-Domain Conversations with LLMs

NLP4ConvAI 2023
Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information

EACL 2023
Knowledge-Grounded Conversational Data Augmentation with Generative Conversational Networks

SIGDIAL 2022
SalesBot: Transitioning from Chit-Chat to Task-Oriented Dialogues

ACL 2022

Open Source

Taiwan-LLM

Open-weight large language models built for Taiwanese Mandarin — pretraining data, instruction tuning, and evaluation for Traditional Chinese. Widely adopted by Taiwan's research community and industry.

Mistral-Small-Reasoning

Open-weight reasoning model distilled for efficient step-by-step problem solving.