BASE DATA
Open Source Data Sets
Lanka Data brings together a powerful Sri Lanka-focused data corpus exceeding 10 billion data points, forming the foundation for accurate trilingual capabilities across Sinhala, Tamil, and English. This is complemented by high-quality conversational and structured datasets spanning key domains enabling precise, context-aware insights for real-world applications.
Conversational Data
With deeply structured datasets across critical sectors like law, taxation, and business, Chat2Find delivers precision, reliability, and real-world intelligence that goes beyond generic AI systems.
BASE MODELS
Open Source Intelligent Layer
At its core, the Chat2Find base model is a robust large language model trained on extensive localized data, delivering strong multilingual understanding, while its fine-tuned trilingual models further enhance performance by capturing linguistic nuances and cultural context, resulting in highly natural, accurate, and reliable AI interactions.

Base Model
At the heart of Chat2Find lies a robust base model pre-trained on rich, localized data corpus enabling powerful multilingual understanding tailored specifically for Sri Lanka.

Fine Tuned LLM
Refined to perfection, Chat2Findโs fine-tuned models capture linguistic nuance and cultural depth, delivering seamless, natural, and highly accurate interactions in Sinhala, Tamil, and English.

















