ce7953f7c33f473f.jpg

Hoyeon Lee

Research Scientist & Tech Lead · NAVER Cloud

NAVER Green Factory, 6, Buljeong-ro,
Seongnam, 13561, Republic of Korea


I am a Senior Research Scientist and Tech Lead in the Voice team at NAVER Cloud.

My research focuses on Natural Language Processing (NLP) and speech-oriented language processing for Text-to-Speech (TTS), with an emphasis on large language models (LLMs). Recently, I have been working on multilingual and cross-lingual representation learning, as well as LLM-driven methods for data generation and evaluation in speech and TTS pipelines, with a focus on scalable and robust modeling for real-world deployment.

I have developed multilingual large-scale language models covering a wide range of languages, including Korean, French, and other cross-lingual settings, and deployed them in AI products actively used across NAVER’s services. Furthermore, my work has been presented at top-tier conferences, including EMNLP, INTERSPEECH, and other leading venues.


news

Apr 2026 I joined AI·SW Maestro as a Technical Mentor. Starting this year, I mentor trainees in building production-grade AI systems across language modeling, speech AI, and multimodal intelligence.
Aug 2025 Our paper “Synthetic Data Generation for Phrase Break Prediction with Large Language Model” was accepted to INTERSPEECH 2025 in Rotterdam 🇳🇱.
Nov 2024 We presented our work “A Two-Step Approach for Data-Efficient French Pronunciation Learning” at EMNLP 2024 in Miami 🇺🇸.
Aug 2023 I give a talk on the accepted paper at INTERSPEECH2023 with slides released here! :sparkles:
Jun 2023 I give a talk on the accepted paper “Lightweight Grapheme-to-Phoneme Conversion Based on Knowledge Distilled BERT
at Summer Annual Conference of IEIE to be held at Jeju. :sparkles: