kashmiri
Here are 7 public repositories matching this topic...
Script Normalization for Unconventional Writing of Perso-Arabic scripts (ACL2023)
-
Updated
Jul 9, 2023 - PLSQL
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🔠️🔢️ The linguistic:Kashmiri category for AI2001, containing Kashmiri language linguistic datasets
-
Updated
Jul 30, 2023 - R
This project implements a Byte Pair Encoding (BPE) tokenizer trained on Kashmiri poetry written in the Latin script. The corpus is derived from the work of Abdul Ahad Azaad, a prominent revolutionary Kashmiri poet of the 20th century.
-
Updated
Mar 2, 2026 - Python
A Python library designed for normalizing Kashmiri text (Persio-Arabic script). This tool standardizes text by handling character variations, consistent punctuation spacing, and digit conversion. It is optimized for Natural Language Processing (NLP) pipelines and Machine Learning data preprocessing.
-
Updated
Feb 14, 2026 - Python
Improve this page
Add a description, image, and links to the kashmiri topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the kashmiri topic, visit your repo's landing page and select "manage topics."