Lanka Data Net (LDN) is a pioneer in structured data collection and intelligent data retrieval in Sri Lanka. It has built extensive, sector-focused repositories spanning legal, taxation, business, and other critical domains, comprising over one billion datasets. These datasets are systematically curated, standardized, and continuously maintained to ensure accuracy, depth, and long-term reliability.

Building on this strong data foundation, LankaData has developed specialized AI-powered domain experts using advanced Retrieval-Augmented Generation (RAG) models. This enables precise, context-aware access to authoritative information, setting a national benchmark for intelligent, data-driven knowledge retrieval. These systems are seamlessly integrated with a fined-tuned LLM and delivered through Chat2Find, positioning it as Sri Lankaโ€™s official AI layer for accurate, structured, and reliable data access.

Lanka Data brings together a powerful Sri Lanka-focused data corpus exceeding 10 billion data points, forming the foundation for accurate trilingual capabilities across Sinhala, Tamil, and English. This is complemented by high-quality conversational and structured datasets spanning key domains enabling precise, context-aware insights for real-world applications.

Data Corpus

Powering the future of Sri Lankan AI, Chat2Findโ€™s data corpus spans over  255 million tokens, creating an unmatched foundation for truly intelligent, trilingual experiences across Sinhala, Tamil, and English.

Conversational Data

With deeply structured datasets across critical sectors like law, taxation, and business, Chat2Find delivers precision, reliability, and real-world intelligence that goes beyond generic AI systems.

Data Archives

LankaData provides a comprehensive suite of structured, high-quality datasets covering Sri Lankaโ€™s legal, economic, taxation, business and regulatory landscape for direct download. 

At its core, the Chat2Find base model is a robust large language model trained on extensive localized data, delivering strong multilingual understanding, while its fine-tuned trilingual models further enhance performance by capturing linguistic nuances and cultural context, resulting in highly natural, accurate, and reliable AI interactions.

Base Model

At the heart of Chat2Find lies a robust base model pre-trained on rich, localized data corpus enabling powerful multilingual understanding tailored specifically for Sri Lanka.

Fine Tuned LLM

Refined to perfection, Chat2Findโ€™s fine-tuned models capture linguistic nuance and cultural depth, delivering seamless, natural, and highly accurate interactions in Sinhala, Tamil, and English.

LankaData delivers real-time data retrieval through intelligent layers built across legal, taxation, business, and other key sectors. Using advanced Retrieval-Augmented Generation (RAG) models, these layers dynamically fetch and interpret authoritative data at query time, ensuring accurate, context-aware, and up-to-date responses seamlessly delivered through AI interface. All Expert models are now accessible through the AIMart mobile app, available on Google Play.

LankaLaw

Digital repository of Sri Lankan legislation, case law, and legal resources with responses delivered seamlessly through AI interface

LankaTax

Digital repository of Sri Lankan tax laws, regulations, forms, and guidance, with responses delivered seamlessly through AI interface

LankaBIZ

Digital repository of Sri Lankan business data, including company profiles and disclosures,delivered seamlessly through AI interface.




LankaData and Chat2Find have collaborated to create Sri Lankaโ€™s first intelligent AI layer for accessing public data, combining LankaDataโ€™s vast, structured repositories with Chat2Findโ€™s advanced language model capabilities. By integrating high-quality datasets across legal, taxation, business, and other critical domains with Retrieval-Augmented Generation (RAG), this partnership enables precise, context-aware, and real-time information retrieval. The result is a powerful, user-friendly interface that transforms how individuals, businesses, and institutions access and interact with authoritative public data-setting a new national benchmark for transparency, efficiency, and data-driven decision-making.