Welcome to my home page!

I am a Computational Linguist focusing on understanding and modeling natural languages, with a special focus on Tamil and other low-resource languages, using computational methods.
I am a Senior Lecturer in Computer Science at the University of Jaffna, Sri Lanka. Additionally, I am a visiting researcher and a member of the Computational Linguistics Group in the Department of Linguistics at the University of Konstanz, Germany.

Education Professional
  • PhD (2022)
  • MSc in Computer Science (2010)
  • BSc (Hons) in Computer Science (2006)
  • Tamil Junior Pundit (Bala Pundit) (2016)

Ongoing Projects (Open for Collaboration)

  • Building a Sri Lankan Tamil corpus
  • OCR/TTS for Tamil
  • Thirukkural Treebank
  • Building a Linguistically-Aware Tamil LLM

Recent news

  • Published a paper on pre-tokenization/tokenization at COLING 2025 - Find it here
  • Successfully organised a workshop on Challenges in Processing South Asian Languages (CHiPSAL-2025) on Jan 19, 2025 at COLING 2025 - Proceedings is here
( Some news from 2024) )
Kengatharaiyer Sarveswaran


sarves at univ.jfn.ac.lk

Department of Computer Science
University of Jaffna
Sri Lanka

Plain Academic