Home - Lanka Data

BASE DATA

Open Source Data Sets

Lanka Data brings together a powerful Sri Lanka-focused data corpus exceeding 10 billion data points, forming the foundation for accurate trilingual capabilities across Sinhala, Tamil, and English. This is complemented by high-quality conversational and structured datasets spanning key domains enabling precise, context-aware insights for real-world applications.

Explore Data Sets

Data Corpus

Powering the future of Sri Lankan AI, Chat2Find’s data corpus spans over 255 million tokens, creating an unmatched foundation for truly intelligent, trilingual experiences across Sinhala, Tamil, and English.

Conversational Data

With deeply structured datasets across critical sectors like law, taxation, and business, Chat2Find delivers precision, reliability, and real-world intelligence that goes beyond generic AI systems.

Data Archives

LankaData provides a comprehensive suite of structured, high-quality datasets covering Sri Lanka’s legal, economic, taxation, business and regulatory landscape for direct download.

BASE MODELS

Open Source Intelligent Layer

At its core, the Chat2Find base model is a robust large language model trained on extensive localized data, delivering strong multilingual understanding, while its fine-tuned trilingual models further enhance performance by capturing linguistic nuances and cultural context, resulting in highly natural, accurate, and reliable AI interactions.

Explore Core Model

Base Model

At the heart of Chat2Find lies a robust base model pre-trained on rich, localized data corpus enabling powerful multilingual understanding tailored specifically for Sri Lanka.

Fine Tuned LLM

Refined to perfection, Chat2Find’s fine-tuned models capture linguistic nuance and cultural depth, delivering seamless, natural, and highly accurate interactions in Sinhala, Tamil, and English.

RAG MODELS

Retrieval-Augmented Generation (RAG) models

LankaData delivers real-time data retrieval through intelligent layers built across legal, taxation, business, and other key sectors. Using advanced Retrieval-Augmented Generation (RAG) models, these layers dynamically fetch and interpret authoritative data at query time, ensuring accurate, context-aware, and up-to-date responses seamlessly delivered through AI interface. All Expert models are now accessible through the AIMart mobile app, available on Google Play.

Explore RAG Models

LankaLaw

Digital repository of Sri Lankan legislation, case law, and legal resources with responses delivered seamlessly through AI interface

LankaTax

Digital repository of Sri Lankan tax laws, regulations, forms, and guidance, with responses delivered seamlessly through AI interface

LankaBIZ

Digital repository of Sri Lankan business data, including company profiles and disclosures,delivered seamlessly through AI interface.

Stay Update with LankaData updates

get in touch

We’d love to hear from you anytime

46/46 Nawam Mawatha, Colombo Sri Lanka

support@lankadata.net

+94 (0) 777 3400 35

LankaData and Chat2Find have collaborated to create Sri Lanka’s first intelligent AI layer for accessing public data, combining LankaData’s vast, structured repositories with Chat2Find’s advanced language model capabilities. By integrating high-quality datasets across legal, taxation, business, and other critical domains with Retrieval-Augmented Generation (RAG), this partnership enables precise, context-aware, and real-time information retrieval. The result is a powerful, user-friendly interface that transforms how individuals, businesses, and institutions access and interact with authoritative public data-setting a new national benchmark for transparency, efficiency, and data-driven decision-making.

Lanka Data

Access Data
with Artificial
Intelligence.

Building Networks with LankaData.

Accelerate Your
Project with Structured Data.

BASE DATA

Open Source Data Sets

Data Corpus

Conversational Data

Data Archives

BASE MODELS

Open Source Intelligent Layer

Base Model

Fine Tuned LLM

RAG MODELS

Retrieval-Augmented Generation (RAG) models

LankaLaw

LankaTax

LankaBIZ

latest news

Stay Update with LankaData updates

Chat2Find Publishes 255M+ Token Trilingual AI Corpus on Hugging Face and LankaData

Free Access to Sri Lanka’s Legal Data Expanded Through LankaData Network and LankaLaw Collaboration

Chat2Find Publishes 255M+ Token Trilingual AI Corpus on Hugging Face and LankaData

We’d love to hear from you anytime

Access Data with ArtificialIntelligence.

Building Networks with LankaData.

Accelerate YourProject with Structured Data.

BASE DATA

Open Source Data Sets

Data Corpus

Conversational Data

Data Archives

BASE MODELS

Open Source Intelligent Layer

Base Model

Fine Tuned LLM

RAG MODELS

Retrieval-Augmented Generation (RAG) models

LankaLaw

LankaTax

LankaBIZ

latest news

Stay Update with LankaData updates

Chat2Find Publishes 255M+ Token Trilingual AI Corpus on Hugging Face and LankaData

Free Access to Sri Lanka’s Legal Data Expanded Through LankaData Network and LankaLaw Collaboration

Chat2Find Publishes 255M+ Token Trilingual AI Corpus on Hugging Face and LankaData

We’d love to hear from you anytime

Access Data
with Artificial
Intelligence.

Accelerate Your
Project with Structured Data.