Shaoxiong is a principal investigator at ELLIS Institute Finland and an assistant professor at University of Turku, Finland. Prior to this, he was an independent research group leader at Technical University of Darmstadt, and a postdoctoral researcher at the University of Helsinki, working on high-performance language technology. He obtained his Ph.D. at Aalto University. [MORE]

Research Group

Our research group at the ELLIS Institute Finland & TurkuNLP, University of Turku focuses on advancing natural language processing and machine learning with applications in healthcare and multilingual scenarios. We work on cutting-edge topics including LLM post-training, multilingual NLP for low-resource languages, and AI for healthcare.

We are committed to:

    Developing robust ML and NLP systems for diverse tasks and domains

    Creating multilingual LLMs that serve underrepresented languages

    Advancing AI applications in healthcare with a focus on trustworthiness and fairness

    Training the next generation of researchers in AI and NLP

Meet our people โ†’

We welcome Master's students who are looking for thesis opportunities and visiting students and researchers in NLP and related fields to work with us.


What's New

We are now seeking postdoctoral researchers to join the ELLIS Institute Finland. Apply here

Jan 12, 2026

Thrilled to have Renhao Pei join the team as a doctoral researcher โ€“ welcome aboard!

Jan 12, 2026

Call for Participation - PsyDefDetect Shared Task at BioNLP, ACL 2026 for Detecting Psychological Defense Mechanisms in Conversations

Start on March 15, 2026 @BioNLP, ACL 2026

One paper accepted by EACL 2026 on reasoning models for MT

January 4, 2026 @Rabat, Morocco

Thrilled to join ELLIS Institute Finland as a Principal Investigator hosted at University of Turku!

October 1, 2025 @Finland

We release EMMA-500 Llama 3/3.1 models and MaLA bilingual translation corpus in 2,500+ language pairs ๐ŸŒ

June 2025

We release a series of CPT models that study the data mixing in continual pre-training ๐Ÿค—

April 2025

We release the preview of GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models

April 2025

Big congrats to Zihao Li on starting PhD research!

March 1, 2025 @Helsinki

Call for Participation - SemEval-2025 Task-3 โ€” Mu-SHROOM, the Multilingual Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

DDL Jan 31, 2025 @SemEval-2025

Thrilled to have Doan Nam Long Vu join the team โ€“ welcome aboard!

January 2, 2025 @Darmstadt

Check out our latest survey paper on LLMs for graph learning

January 2, 2025