Welcome to my home page!

I am a Computational Linguist focusing on understanding and modeling natural languages, with a special focus on Tamil and other low-resource languages, using computational methods.
I am a Senior Lecturer in Computer Science at the University of Jaffna, Sri Lanka. Additionally, I am a visiting researcher and a member of the Computational Linguistics Group in the Department of Linguistics at the University of Konstanz, Germany.

Education Professional
  • PhD (2022)
  • MSc in Computer Science (2010)
  • BSc (Hons) in Computer Science (2006)
  • Tamil Junior Pundit (Bala Pundit) (2016)

Ongoing Projects (Open for Collaboration)

  • Building Small Language Models
  • Building a Sri Lankan Tamil corpus
  • OCR/TTS for Tamil
  • Thirukkural Treebank
  • Building a Linguistically-Aware Tamil LLM

Recent news

  • Looking for two master's/PhD students for funded projects to work on LLMs and ASR for Sri Lankan Tamil.
  • Organising the second workshop on CHiPSAL: Challenges in Processing South Asian Languages at LREC2026.
  • Felicitated as an honorary DAAD Research Ambassador for the period 2025–2028 at the German Embassy in New Delhi on 17 November 2025.
  • Gave two invited talks at Mohammad Ali Jinnah University (MAJU), Karachi and UET Lahore on the topic of Challenges in Processing South Asian Languages: Insights from Tamil and Sinhala on the 5th and 12th of November 2025.
  • Organising a Summer School on Language Technology at the University of Jaffna, 29–30 June 2024 – Find out more
  • Published a paper on pre-tokenization/tokenization at COLING 2025 - Find it here
  • Successfully organised a workshop on Challenges in Processing South Asian Languages (CHiPSAL-2025) on Jan 19, 2025 at COLING 2025 - Proceedings is here
( Some news from 2024) )
Kengatharaiyer Sarveswaran


sarves at univ.jfn.ac.lk

Department of Computer Science
University of Jaffna
Sri Lanka

Plain Academic