Juan Pino

Research Scientist

I’m a research scientist at Facebook working on neural machine translation and language modeling. I obtained my PhD from the University of Cambridge under the supervision of Bill Byrne where I developed a new model for translation grammar extraction from word alignment models and built translation systems that obtained the best automatic score at WMT10 and WMT13 in the French-English and Russian-English tracks.


Machine translation, language modeling and natural language processing

Latest Publications

ACL - August 9, 2021

VoxPopuli- A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation

Changhan Wang, Morgane Rivière, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel Haziza, Mary Williamson, Juan Pino, Emmanuel Dupoux

IWLST - August 2, 2021

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task

Yun Tang, Hongyu Gong, Xian Li, Changhan Wang, Juan Pino, Holger Schwenk, Naman Goyal

ACL - August 1, 2021

Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task

Yun Tang, Juan Pino, Xian Li, Changhan Wang, Dmitriy Genzel

ICASSP - June 6, 2021

A General Multi-task Learning Framework to Leverage Text Data for Speech to Text Tasks

Yun Tang, Juan Pino, Changhan Wang, Xutai Ma, Dmitriy Genzel

AACL - December 4, 2020

FAIRSEQ S2T: Fast Speech-to-Text Modeling with FAIRSEQ

Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, Juan Pino

Interspeech - November 9, 2020

Self-Training for End-to-End Speech Translation

Juan Pino, Qiantong Xu, Xutai Ma, Mohammad Javad Dousti, Yun Tang

EMNLP - November 9, 2020

SIMULEVAL : An Evaluation Toolkit for Simultaneous Translation

Xutai Ma, Mohammad Javad Dousti, Changhan Wang, Jiatao Gu, Juan Pino

COLING - November 9, 2020

Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation

Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier

Interspeech - October 29, 2020

Self-Supervised Representations Improve End-to-End Speech Translation

Anne Wu, Changhan Wang, Juan Pino, Jiatao Gu

LREC - July 17, 2020

CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus

Changhan Wang, Juan Pino, Anne Wu, Jiatao Gu

ICASSP - May 7, 2020

SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation

Arya D. McCarthy, Liezl Puzon, Juan Pino

ICLR - April 29, 2020

Monotonic Multihead Attention

Juan Pino, James Cross, Liezl Puzon, Jiatao Gu, Xutai Ma

WMT - November 25, 2019

Findings of the First Shared Task on Machine Translation Robustness

Xian Li, Paul Michel, Antonios Anastasopoulos, Yonatan Belinkov, Nadir Durrani, Orhan Firat, Philipp Koehn, Graham Neubig, Juan Pino, Hassan Sajjad

EMNLP - October 31, 2019

The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali–English and Sinhala–English

Francisco (Paco) Guzman, Peng-Jen Chen, Myle Ott, Juan Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato

WMT at ACL - August 2, 2019

Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions

Philipp Koehn, Francisco (Paco) Guzman, Vishrav Chaudhary, Juan Pino

NAACL - June 10, 2019

On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models

Paul Michel, Xian Li, Graham Neubig, Juan Pino

ArXiv - November 24, 2018

Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications

Jongsoo Park, Maxim Naumov, Protonu Basu, Summer Deng, Aravind Kalaiah, Daya Khudia, James Law, Parth Malani, Andrey Malevich, Satish Nadathur, Juan Pino, Martin Schatz, Alexander Sidorov, Viswanath Sivakumar, Andrew Tulloch, Xiaodong Wang, Yiming Wu, Hector Yuen, Utku Diril, Dmytro Dzhulgakov, Kim Hazelwood, Bill Jia, Yangqing Jia, Lin Qiao, Vijay Rao, Nadav Rotem, Sungjoo Yoo, Mikhail Smelyanskiy